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Abstract : We study the reconstruction and particle identification (PID) problem for Ring Imag¬ 
ing devices providing a good knowledge of the direction of the Cerenkov photons, as the DIRC system, 
on which we specialize. We advocate first the use of the stereographic projection as a tool allowing 
a suitable representation of the photon data, as it allows to represent the Cerenkov cone always as a 
circle. We set up an algorithm able to perform reliably a fit of circle arcs of small angular opening, 
by minimising a true expression. The system we develop for PID relies on this algorithm and on a 
procedure able to remove background photons with a high efficiency. We thus show that, even when 
the background is large, it is possible to perform an efficient PID by means of a fit algorithm which 
hnally provides all the circle parameters ; these are connected with the charged track direction and 
its Cerenkov angle. It is shown that background effects can be dealt without spoiling significantly the 
reconstruction probability distributions. 
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1 Introduction 


Cerenkov Ring Imaging devices settle specific problems of pattern recognition. Generally, 
devices are designed in such a way that one has to look for photons projected onto a circle 
ii, the radius of which being simply connected to the Cerenkov angle of the charged track 
radiating these photons. In RICH detectors (as for DELPHI), the main problem is due to 
background photons which come together with the real signal photons and render difficult 
pattern recognition, basically because the number of independent constraints which can be 
used is small. Traditionally, current methods in pattern recognition follow more or less the 
ideas from Baillon 0, assume the knowledge of the charged track information and perform a 
maximum likelihood £t to the number of background and/or signal photons, under the various 
mass assumptions. 

It is not of current knowledge to attempt a full fit of the ring parameters, mainly because 
background photons are expected to spoil the £t quality, and anyway prevent to define a 
reasonable to be minimised and then a £t probability having the properties expected for 
a probability distribution. This is mainly due to the difficulties encountered when trying to 
select a set of photons associated with a track and possibly still affected by contamination from 
background photons. 

Motivated by the DIRC device 0, |^, which will be used by the BaBar Collaboration [Q 
at the SLAC PEP-H B-factory, we reexamine this problem with the goal of associating to a 
given charged track, a set of photons with a low level of contamination, having in mind to be 
able to check in each photon sample the effect of a residual background contamination. This 
provides several problems to be solved in order to have a procedure able to run under realistic 
conditions. 

In a DIRC device, one can dehne with a controlled accuracy the Cerenkov photon directions. 
This allows to reconsider the problem of data representation, with the aim of having a well 
dehned geometrical hgure to look for. We advocate here that the stereographic projection 
provides such a suitable tool. Whatever are the errors, it allows to look always for an image 
of the Cerenkov ring which is surely a circle arc, without any special hardware request on the 
DIRC device considered. This is true even if the charged track direction is poorly known, 
possibly even unknown. In real life however, because of background, one cannot completely 
avoid some knowledge ~ even approximate and/or subject to systematic errors - of informations 
about the charged track direction. In this representation, the radius of this circle is connected 
in a simple way with the Cerenkov angle, and the circle center with the charged track direction. 
The stereographic projection has also been proposed as a tool for pattern recognition^ with a 
RICH detector which will use a 27 Kton water target and radiator in order to detect neutrino 
oscillation at Gran Sasso, using a long base line neutrino beam from CERN |]^. 

In working conditions, one is faced with another problem, specihc to the DIRC : generally, 
the part of the Cerenkov cone populated by observed photons represents a relatively small 
azimuthal window compared with 360°. In any representation of the data, this corresponds to 
a small circle arc populated with measured points. Taking into account the relative magnitude 
of the errors affecting these points and the circle radius, this generally prevents a direct use 
of standard circle fit algorithms. Indeed, in such conditions, the fit quality is poor, even in 

^We thank T. Ypsilantis (INFN, Bologna) for this information and for having drawn our attention on the 
corresponding references. 
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absence of background, as the radius is systematically underestimated. In order to circumvent 
this difficulty, a further information has to be used and it happens that the charged track 
direction is a good candidate for this purpose. This is basically the idea of Ref. but in 
a completely different context. Indeed, one can always assume that we have some knowledge 
about the charged track direction provided by some tracking device located in front of the 
DIRC. However, this implies a deep change in the circle £t algorithm structure. The pattern 
recognition problem of the long base line RICH of Ref. 0, even if not trivial, is much simpler 
than for the DIRC as the circle arc to reconstruct is generally quite large and the errors affecting 
photon directions are much smaller. 

In this way, one is led to introduce as measurements to be £t the angular distances between 
each of the various photon directions and the charged track direction. This introduces strong 
correlations which have to be carefully accounted for when constructing the expression for the 
to be minimised. These correlations are partly due to the measurement errors on the charged 
track direction and partly due to multiple scattering effects undergone by the charged track 
inside the radiator ; this last kind of correlations can be important in identifying tracks of very 
low momentum (typically below 1 GeV/c) and they survive even if the circle arc populated by 
photons is large enough that we need not worry about circle center information. 

Finally, having dehned appropriate tools for pattern recognition and particle identihcation, 
it remains to dehne a procedure able to provide a set of photons really associated with the 
track. As the number of constraints available for photon recognition is small, tools allowing to 
control background contamination effects in this sample are needed. 


The paper is organised as follows. In Section |^, we briefly outline the DIRC structure 
and properties relying on known literature 1 . 0 . in its aspects relevant to reconstruction and 
particle identification. In Section we recall the properties of the stereographic projection and 
the connection with Cerenkov cone reconstruction. Section ^ is devoted to describing the circle 
£t algorithm and recognition procedure when the circle arc populated by measured points is 
small ; we also sketch here how to deal with photon contamination control and background 
removal. In section ^ we describe the Monte Carlo we have coded in order to check the full 
procedure ; all effects affecting the recognition procedure are included and varied from minimum 
(to check the basic model properties) to a maximum (photons produced by several tracks in 
the same bar, with or without additional flat background). The procedure of photon selection 
and background removal is described in details in section |^, it is more specihc to the DIRC 
problem than the £t itself. This procedure relies on using hrst a clean subsample of photons 
(unambiguous photons) to start an iterative recognition procedure. In section we describe, 
the working of the procedure and show that it behaves well, even under very large background 
conditions ; the effects of correlations are specifically illustrated and it is shown that they 
cannot be neglected. Of course, photon selection relies on cutting out photons candidates ; 
then. Section ^ illustrates how cut levels can be adjusted and tuned in such a way that the 
probability distributions are not too much affected. We thus show that pull distributions and 
probability distributions allow to perform the background removal with a controlled quality. 

Finally, two appendices contain a full treatment of the multiple scattering effects and cor¬ 
relations. We describe here also our modelling, where all effects are taken into account at hrst 
order only. Monte Carlo results show that the effects of this approximation are small in the 
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region where a specific particle identification is expected to be reliable with a DIRC device, i. e. 
above 500 MeV/c. 

2 Outline of a DIRC Device and Properties 

The DIRC (Detection of Internally Reflected Cerenkov light) is a new type of detector which 
will be used as the main particle identification system in the BaBar experiment P] at PEP2 
collider (SLAC). A prototype of the DIRC P has been extensively tested in the CERN PS beam 
in 1995/1996, demonstrating the validity of the DIRC concept. It will not be described here in 
full details. We will rather briefly outline the general principles involved in its conception and 
operation, more information can be found in recent literature on the BaBar DIRC (see Refs, , 
1^ and 1^). We mainly limit ourselves to the aspects of relevance for the procedure we develop 
and the Monte Carlo we will use in order to test it. 


The originality of a DIRC device is that, contrarily to most other Cerenkov ring imaging 
detectors, it makes use of the Cerenkov light generated in the radiator medium by trapping 
photons (through total internal reflection) into the radiator itself and guiding them toward a 
set of photomultipliers (PMTs) for detection ; this also allows the detector to be quite thin 
in the direction of the incoming tracks, because the Cerenkov cone expands outside the main 
sensitive area of the detector. 

In the DIRC, the radiator is made of long rectilinear fused silica bars of modest rectangular 
section, a material chosen mainly because of its high refractive index (1.474) p, § compared 
to air or nitrogen and its long absorption length in the UV region. As sketched in Fig. |l|, the 
quartz/air interfaces of the bars act as perfect mirrors for a wide range of Cerenkov photons 
incidence angles, and, for a sufficient optical and geometrical quality, they are able to transport 
the photons to the bar exit, with unperturbed directions except for reflection symmetries with 
respect to the bar faces. 

The number of photons produced in the quartz bars depends on several parameters. Inter¬ 
estingly, it tends to increase with the incidence angle (since the path length inside the radiator 
medium grows), which is a behavior rather opposite to more traditional ring imaging detectors, 
where a large part of the Cerenkov light is lost because of the internal reflection inside the 
radiator. 

To avoid loosing too much light at the bar exit, the array of photomultiplier windows and the 
light quartz bar exits are immersed in pure water, which refractive index (1.34) is close to the 
quartz one (1.474), so that photons crossing the quartz/water interface have a low probability 
of being reflected back into the bar. 

There is also a reflecting device parallel to the bottom bar surface put at the bar end (in 
the water) in order to redirect toward the PMTs the photons emerging from the bar with either 
downward or too upward going directions ; the other bar end is equipped with a small mirror 
in optical contact with the quartz for the same purpose. 

Once the Cerenkov image is detected on the PMTs array, two informations are available : 
the location of the hit PMT and the photon detection time. This image (see Fig. ^ for an 
example from the DIRC prototype tested at CERN [^) is in fact a superposition of different 
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reflections of the original Cerenkov cone by the bar reflecting snrfaces. So, a given hit can 
belong to the Cerenkov cone centered on the original track direction, or to any of its images 
with respect to the reflecting snrfaces of the bar. In the case of the prototype of Ref. p, 8 
reflected photon directions (dne to 3 possible symmetry reflections planes) are to be taken into 
acconnt. This ambignity problem is specihc to the DIRC : given a dehnite photon direction, 
there are as many track-photon associations as reflection planes and it may be that several 
combinations are physically acceptable. Fortnnately, it may also happen that only one solntion 
is acceptable ; this dehnes nnambignous solntions. Fig. | also illnstrates that the arc popnlated 
by the hits can be relatively small (of the order of 60 degrees). 

Using the spatial information, one is able to reconstrnct nniqnely the original Cerenkov angle 
of a photon (which is emitted on the Cerenkov cone) if the ambignities related with the varions 
possible reflections that conld affect the photon dnring its propagation can be disentangled. 
The timing information is mainly nsed for a preliminary recognition of backgronnd photons, 
generally ont of time for a given track, as the information provided by the individnal photon 
detection time is very poor compared to the spatial information given by the PMTs. 


Thns, a possible strategy for pattern recognition in the DIRC is effectively to try dis¬ 
criminating spatially between photon ambignities in order to determine the correct symmetry 
assignment, and then nse the set of non ambiguons resnlting photon Cerenkov angles (from 
the whole image) in order to compnte the relevant qnantities we are interested in : the track 
Cerenkov angle and its error. This is achieved throngh a htting procednre, and these infor¬ 
mations allow in a second step to rehne the choice among the ambiguons photon solutions left 
provisionally aside, and take part of them into account in a hnal £t. Among the difficulties 
that are to be met, the smallness of the arc length populated by photons should be noticed and 
has to be especially addressed. 

In the following sections, we detail the different parts of the algorithm implemented in order 
to achieve this goal. 

3 Pattern Recognition Using the Stereographic Projec¬ 
tion 

In a transparent medium of index n, Cerenkov photons are emitted by a charged particle 
with an angle 6c with respect to the charged track direction and this angle is given by : 

cos9c = ^ ( 1 ) 

njj 

where (3 is the speed of the particle. As the photon wavelength^ is generally not measured, this 
turns out practically to assume that the refractive index n is known with a random error 5n^ 
independently for each photon. 

Let us associate to each of the charged track and photon directions a unit vector, and draw 
all of them from a common origin denoted (0,0,0). All end points of these vectors lay on a 

^ We postpone to Appendix A commenting on the influence of chromaticity fluctuations (dependence of n 
on the photon wavelength A) which affects each photon direction. 


5 



unit sphere, and all photon directions generate a cone of half aperture 6c around the charged 
track direction. From now on we shall always refer to photon and charged track directions only 
in this representation. 


3.1 The Stereographic Projection 


The intersections of the photon directions with a plane perpendicular to the charged track 
direction at unit distance from the origin dehne a set of points distributed along a circle of radius 
tandc, centered at the intersection of the charged track direction with this plane. However, if 
the actual charged track direction is not the one chosen in order to dehne the plane, the hgure 
represented by the intersections of the photon directions with this plane is no longer a circle but 
an ellipse, and departure from a circle may become large if the charged track direction is poorly 
known0, because of systematic errors in the measurement of the charged track direction, or 
misalignments effects. 

A way to circumvent this problem (or minimize it at least) is to use the stereographic 
projection]^ [|T^]. Let us briehy recall it. Let us choose on the unit sphere dehned above the pole 
axis along the measured charged track direction. The stereographic projection of points on the 
sphere is the intersection of the lines joining the south pole (0, 0, —1) to these points with the 
equatorial plane. In this transform, a circle drawn on the sphere {i.e. the intersection of the 
Cerenkov cone with the sphere) is projected out as a circle. Then the Cerenkov cone centered 
along the charged track direction becomes a circle of radius tan6*c/2, centered at the origin. 
This origin is simply the image of the charged track direction, i.e. the projection of the north 
pole as seen from the south pole. This is sketched in Fig. Ref. 0 prefers performing the 
stereographic projection onto a plane tangent to the sphere at the north pole, rather than onto 
the equatorial plane ; correspondingly, the algebra is slightly modihed with respect to what will 
be presented just below. 

In practical applications, however, we only know approximately the direction of the charged 
track, and therefore the pole axis as dehned above coincides only approximately with the actual 
charged track direction. In order to illustrate what happens, let us assume that the actual 
unknown charged track direction makes an angle a, possibly large, with the pole axis {i.e. the 
reconstructed charged track direction). Then, by means of the stereographic projection, the 
images of the photon directions in the equatorial plane are still on a circle (see Fig. the 
radius R of which being : 

, 9 Ci 

a 1 + tan — 

R = tan- -Q — - (2) 

^ l-tan^ —tan^- 
2 2 

and the circle center is shifted from the origin to a point located at distance ro : 


ro = tan 


a 


1 + tan" 


Oc 


^ 1 — tan^ — tan 
2 


2 ^ 
2 


(3) 


^Basically, it is a standard conformal mapping of the sphere onto a plane, i.e. angles on the sphere are 
conserved in the projection. 
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close to the image of the actual charged track direction, which is located at tana/2. 

It is clear from rels. and that the error on the circle center is of hrst order in a, 
while the corresponding error on the radius is only of second order. Therefore, the robustness 
of the stereographic projection follows from the fact that the analytical shape of the Cerenkov 
hgure in the equatorial plane is always a circle, even if the actual charged track direction is 
quite different from the measured one. Moreover, the radius is affected at second order only by 
angular errors on the charged track direction. If the angle a were large, it is clear from Reis, 
d^) and (Q) that having determined by £t R and tq allows anyway to reconstruct the correct 
Cerenkov angle. Stated otherwise, the pole axis used in order to perform the stereographic 
projection may be chosen independently of any assumption on the charged track direction. In 
general, we have : 


tan^c = 


2R 


2R 


1 - R2 + , 


1-R2 


1 - 


1-R2 


+ 0(4 


(4) 


If To is small enough, R is negligibly affected^ but can easily be corrected. 

The correspondence between the coordinates of a point on the unit sphere {X,Y,Z), and 
those of its image (x, y) into the equatorial plane (through the stereographic projection) is 
dehned by : 


X = 


X 

1 + Z 


Y 

1 + Z 


(^2 + y 2 ^ ^2 ^ 1 ) 


This transform is never singular in our case as we have always Z > 0. 


(5) 


3.2 Handling Measurements and Errors 

Let us assume that the above coordinates X, Y and Z (or any other quantity) are measurable 
quantities with known covariance error matrix, originating from random distributions X, Y and 
Z ; assuming the measurement process unbiased, we can write X = Xt^ue + where Xt^ue 
is the true (unknown) value of X and 6X a random variable of standard deviation ax and of 
zero mea nQ: 


r < >= 0 

1 < [5X]2 >= 4 


( 6 ) 


Correspondingly, for any other measurable quantity, we dehne a centered error function carrying 
the corresponding standard deviation {ay or az, for instance). As we assume the measurement 
process unbiased, we should have indeed < X >= Xtme- Using this language, true and mea¬ 
sured values are quantities which differ by first order terms 0{6X). 

The error functions affecting the coordinates x and y (in the equatorial plane) can be derived 
by differentiating Eqs. (j^) : 

9c = 500 mr, in order that 2 arctani? provides an overestimate of at least 0.1 mr, the error on the charged 
track direction should be, at least, a ~ 30 mr. 

^ We shall always use the notation < / > for the expectation value of any random variable /. 
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(7) 


Sx = 


1 + Z 


-SX 


X 


(i + zy 


sz 


Sy = 


-SY- 


Y 


rSZ 


l + Z"* (1 + ^)2 

When estimating errors using these expressions, X, Y and Z should be the corresponding 
true values. As they are unknown, one classically uses instead the measured central values. In 
terms of differentials, this lack of information affects second order terms (like (hX)^). So, at first 
order, it is legimate to use directly measured values while estimating coefficient functions. It is 
clear that going beyond first order development in analytic expressions would raise a problem 
here, as the additional terms would be competing with the (uncontrolled) terms introduced by 
using the measured values instead of the true ones. 

If the measurement (X, Y, Z) is unbiased, the point {x, y) is unbiased too at leading order 
(he. < Sx >=< Sy >= 0). The covariance terms (< [Sx]^ >, < [Sy]"^ > and < SxSy >) can 
be computed in terms of X, Y and Z and of their errors and correlations ; when computing 
them, one has to take into account that X^ + = 1 and that their error functions are 

not independent : XSX + YSY + ZSZ = 0. 


4 A Circle Fit Algorithm 

As explained in section using the stereographic projection, the directions of the Cerenkov 
photons emitted by a charged track are represented by points in a plane laying on a circle. 
Up to second order terms, the circle center is nothing but the projection of the charged track 
direction onto the (equatorial) plane of the sphere. Therefore, the reconstruction problem of 
the Cerenkov angle is replaced by a problem of circle recognition in a plane. 

It is a long standing problem to find the most suitable way to perform a circle fit to a given 
set of points affected by measurement errors (see refs. 011,11 for instance). The main 
problems addressed in (necessarily) approximate procedures are : 

• linearisation of circle parametrisation 

• non-gaussian character of the errors on the circle center coordinates and radius. 

In addition to the above mentioned questions, we address two more issues, connected with 
the BaBar DIRC, namely : 

• in any representation, the measured points are not spread out onto the whole circle, 
but along a relatively small arc (about 60° degrees). Taking into account the relative 
magnitude of the error on the points and of the circle radius value, this happens to affect 
deeply the circle fit quality, if no additional information on the circle center is accounted 
for. 

• there exist correlations among photons as a consequence of the multiple scattering un¬ 
dergone by the emitting charged track. Accounting for further constraints (charged track 
direction measurement) may introduce further correlations (see Sections 2 and 3 in Ap¬ 
pendix B for instance). 
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4.1 The for Fitting a Circle Arc 

Let us assume that we have measured points {xi,yi) with random errors {Sxi,6yi), not 
necessarily gaussian. As we restrict our study to a DIRC device where the points are actually 
photons, we shall use indifferently the words photon and point. We do not state any assumption 
on these errors, except that the measurements are unbiased : 

< 6xi >=< 6yj >= 0 , yi, j = 1, ■ ■ ■ (8) 

Stated otherwise, the expectation values < 6xi 6xj >, < 6xi dyj > and < 5yi 6yj > which 
define errors and correlations are not constrained and no further assumption is needed on higher 
order moments. In the approach we have followed, the effects of multiple scattering are not 
affected to the photons measurements but rather to the track direction. 

We assume that we have, beside the circle points, also a measurement of the circle center 
coordinates and its error ; we define our reference frame in such a way that this measured center 
is located at the origin. The circle parameters we fit are a and b (the center coordinates) and 
R (its radius). This parametrisation is examined in full details in Appendix B. 

The f o minimise in order to get the best circle fitting points is defined by (see Appendix 
B 4) : 


with : 


= ^(d, - R)id, - R)Vr^ + WS-'A 


(9) 


di = ( 10 ) 

where a and b are the circle center coordinates parameters (measured as (0,0), up to errors) 
to be fit. We have denoted by A the vectoif] (a, b) of the center coordinate and by A* its 
transposed. The error matrix E depends on the reconstruction errors of the track at the DIRC 
entrance (Eq), on the number of photons which take part to the fit {n^) and on the multiple 
scattering undergone by the charged track inside the radiator ; it is explicitly computed in 
Appendix A and in Sections 2 and 3 of Appendix B. 

In usual approaches this second contribution to the is not considered 0,00 ; however, 
when the circle center is constrained by an auxiliary measurement, it is legitimate to use it. 
Moreover, it is harmless to remove it only if the circle length populated by the measured points 
is large enough (typically greater than 180°) and/or if < [5di]‘^ >/di is small enough. In the 
case of the DIRC, none of these conditions are practically met, and removing the constraint 
on the center represented by the second term in the RHS of Rel. (^ may simply lead to a 
complete failure of the circle fit, even in absence of fake photons. 


The matrix V which appears in Rel. (P) is the error covariance matrix. It is also the matrix 
of the error function expectation values : 

®Actually, the vector (a, b) should be written (a — ameasured, b — bmeasured) ; however we take into account 
that the measurement has been conventionally set at (0,0). 
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Vij =< 6di 6dj > , Vi, j = 1, • ■ ■ 

The error function Sdi affecting the measurement i is given by : 


( 11 ) 


Sdi = 


Xi{Sxi + + yijSyi + Sbj) 

di 


( 12 ) 


(see Appendix B4) up to higher order terms. In this expression, we use as central values for 
the circle center coordinates the point (0,0), while the error functions on the circle center are 
estimated for each photon. It is the reason why the error functions which appear in Rel. ([l^) 
are Sai and Sbi, referring to the charged track direction when it emits photon i and Sxi and 
Syi are the measurement errors of the direction of photon i. The form of the error functions 
for Stti and Sbi is given in Appendix Bl, using preliminary results from Appendix A. The form 
of the error function Sdi is explained in Appendix B4 ; the elements of the matrix V in Rel. 
( 0 ) are computed in Section B 5. One can see there how correlations terms like < SatSaj >, 
< SttiSbj >, ...produce non zero correlation terms in < SdiSdj >. Consequently, one can 
interpret a and b in Rel. ([I0|) as the mean values (to be fit) of the sets {aj} and {&*}. Actually, 
the single information - except for errors - we have on these sets are the measured values at 
the bar entrance ; how this is accounted for is explained in details in Appendix B. 


4.2 Linearisation of the Circle Parameter Equation 

It remains to linearise Eq. (||). The procedure is quite usual [^, |T^, |T^ and turns out to 
replace Eq. (^ by : 


X 


2 





ml 




(13) 


where Rq is an estimate of the radius (a weighted mean of the di at start and, in the forthcoming 
steps of the iteration the fit value of R at the previous step of the iteration). The matrix V 
depends on the circle center coordinates. They are chosen here at start as (0, 0), and can also be 
updated in the forthcoming steps of the iteration procedure. The final step toward linearisation 
is to define as fitting parameter c = R^ — — b"^ instead of R, together with a and b. Then Eq. 

becomes : 


X 




< ^ij — 4-Ro ^ij (14) 

Ci = X- + yf - 2axi - 2byi - c 

and this last expression for is quadratic in a, b and c. Eqs. ( P^Bf ) and ( P!4[ ) simply follow from 
the fact that near the minimum we have : 


{di - R) 


{di + R) 2Rq 
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This relation fastly improves when iterating, and few iterations only are needed in order to be 
at less than 10“^ from xlnin (generally 2 steps are enough for the accuracy just quoted). 

The conditions defining the minimum are : 


dx^ 


dx^ 


= 0 = 0 ^ = 0 ( 18 ) 

da ob oc 

and provide a linear system of equations for a, b and c which gives an optimum solution to the 
minimisation problem. On the other hand, denoting the variables a, b and c by Ua, (a = 1, 2, 3), 
Eqs. (^) can be written (summation over repeated indices is understood) : 


X TafjUoiU T Z(yUa T K (16) 

where the matrix T, the vector Z and the scalar K can easily be expressed in terms of the 
(given) W and S matrices and of the photon coordinates ([xi,j/i],i = moments. In 

addition, the matrix T~^ is the error covariance matrix for the fit parameters {ua,a = 1,2,3) 
. This covariance matrix gives the error contour at Xmin + I; 1*^ contour. 

The number of degrees of freedom associated with the in rel. (^), is — 1 and then the 
fit probability is the value of the probability function Pr(y^,n.y — 1). One can also define a 
consistency check of the set of photons under consideration with the set of track parameters : 
the charged track direction providing the expected coordinates of the circle center oq = 0 and 
6o = 0 (the measured values) and the -five- possible values of the circle radius corresponding 
to the -five- possible mass assignments for the charged track, Rk {k = 1, • • - 5). In this case, 
the y^ simplifies to : 


16 


Ci = I? +!/? - Co 

where cq takes five possible values cq = Rl, each corresponding to one of the possible values 
of Rk = tan 0^/2. In this way, one can check the consistency of the set of photons considered 
with the measured charged track parameters (oq = 0, 6o = 0, Rk) for each of the five possible 
mass assignments. The y^ just above corresponds exactly to degrees of freedom. One can 
then decide to choose the best assignment as being the one which corresponds to the lowest y^, 
provided it is above some significance threshold (a lower probability cut). We illustrate in the 
following that the corresponding probability distributions have all expected properties. 

4.3 Fit Likelihoods, Fake Photons and Contamination 

Let us denote Xn^-i and y^_^, the y^ defined resp. by Reis. (0 and (Eli)- Given a set of 
photons, it is clear that, using standard formulae, the former y^ allows to define a maximum 
likelihood including the center measurement, while the latter leads to the likelihoods of the full 
measured track with the full photon set considered, for each possible mass assignment. 


11 










In practical applications, however, it should be noted that dehning the true (maximum) 
likelihood, implies that : 

• the photons considered are indeed photons, 

• the photons are actually connected with the track considered, 

• the photons errors are correctly estimated. 

This sets the problem of background photons. Indeed, in the DIRC, actual photons have 
errors which are well approximated by gaussians and can be computed, more or less accurately, 
with known information (geometry, chromatic errors, ...). If an observed hit is not an actual 
photon (noise) or if it is an actual photon but emitted by another track, possibly from an¬ 
other quartz bar, its error distribution (and its standard deviation) are completely unknown ; 
therefore, their actual contribution to any cannot be estimated with any controlled accuracy. 

Stated otherwise, any procedure aiming at providing a reasonable estimate of the prob¬ 
ability (or of any likelihood), should remove background photons from the photon sample kept 
in the £t and the x^ estimates. On the other hand, it is clear that, in presence of noise, any 
estimate of the x^ is altered, except if one would be able to remove background from the photon 
set at the 100% level, which is generally hopeless. 

Fortunately, the level of actual contamination in any photon set left by any cleaning up 
procedure can be checked statistically. Indeed, as can be seen from Reis. ([I3| ) - (pT]), the fit 
solution found for R (denoted here R/u) and its error crucially depend on the photons used, 
their errors and their mutual correlations, all informations which can be accessed with a good 
accuracy. If the photon set is too much contaminated, the fit solution may depart signihcantly 
from the expected solution. 

The pull {Rfit — Rtrue )/ctr should follow a centered gaussian law of unit standard deviation. 
This property provides a quantitative criterium to test the quality of background removal. 
Indeed, the most likely effect of background is to shift Rfu from its expected value ; this is 
reflected in the pull distribution by a shift of its mean value from zero and an increase of its rms 
with respect to 1. The magnitude of observable departures from the standard pull expectations 
clearly signals a more or less acceptable level of contamination, as soon as a correct pull behavior 
has been ascertained with noise-free samples. Therefore, checking the model with noise-free 
samples is an unavoidable step in constructing the procedure. 

When running with Monte Carlo data, all checks can easily be performed ; when running 
with real data, several checks are still possible using selected events samples like pions from 
Kg decays or protons from A decays, which can be selected using kinematics only, i.e. inde¬ 
pendently of the DIRC. Assuming that background conditions are not especially dependent on 
the existence of such particles in the events, the quoted pull can be plotted and the influence 
of background contamination inferred. 

4.4 Pattern Recognition Using a Circle Fit Algorithm 

In practical applications we have a collection of hits that are associated with Cerenkov 
photons emitted by a given track. These hits are of three different kinds : 

• photons emitted by the track under consideration 
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• photons emitted by other tracks than that under consideration 

• background hits associated with electronic noise of photomultipliers or to unidentihed 
tracks (which practically can be merged with noise). 

Hits of the hrst kind have generally well behaved error functions (not far from gaussians, 
if not exactly gaussians), while the other two kinds of hits have practically unknown error 
distributions and should be removed in order to give a statistical meaning to the circle £t. 
Stated otherwise, the photon sample must be cleaned up. There is also, specific to the DIRC 
device, another category of fake photons (the ambiguity problem) which will be addressed in 
Section ^ 

Basically, the cleaning procedure of the photon sample relies on the fact that photons, actu¬ 
ally emitted by the track considered, have error distributions which can be well approximated 
by gaussians. In this case, a suitable criterium in removing fake photons is to eliminate all hits 
giving a contribution to the greater than some maximum value. 

Therefore, the recognition procedure turns out to perform a £t with a starting sample of 
photons, in order to have an estimate of the circle parameters (a, b and R), then compute the 
distance of each photon to this circle, remove those which are too far, and restart the procedure 
with the surviving photons ; the procedure is repeated up to convergence. At this point, one 
can consider that the circle parameters are reliable and reexamine all possible photons in order 
to keep those which are at acceptable distance from the expected circle (typically less than 
about 9). In this way, one recovers ambiguous photons which were put aside in order to start 
the reconstruction procedure. Using this new enlarged sample, one can then perform a hnal 
circle £t and get the optimum circle parameter values. Practically, in the case of the DIRC, 
there are some subtleties which allow to improve background rejection and photon recovering ; 
they will be described with more details in Section 

5 Simulation of a simplified DIRC 

A fast Monte Carlo program was written in order to test the reconstruction algorithm. 
We have coded this program in order to output all needed information allowing to check the 
algorithm behavior in full details, which is not usually an easy task within a Monte Carlo 
simulating a complete experimental detector. This program simulates only one quartz bar 
and a PMT array plane (3 cm diameter PMTs packed in a rectangular lattice, located at 
approximately 120 cm from the bar exit). The angle of the PMT plane with respect to the bar 
axis could be chosen. The bar itself was 5 meters long, which is approximately twice longer 
than the DIRC prototype bar of Ref. [Q, but corresponds to the actual size of the bars of the 
BaBar DIRC P|. The distance between the bar exit face and the PMT plane has been chosen 
following BaBar DIRC setup. 


The simulation included most part of effects that could hamper DIRC performances: 

• Errors on track direction and momentum, supposed to simulate the response imprecision 
of a tracker in front of the DIRC bar. In current running conditions we have chosen 
(j{p)/p = 3. 10“^ and the generated angular errors have rms a{6) = cr(0) = 1 mr. 
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• Realistic detector geometry (geometrical uncertainties are important in the DIRC) 

• Chromaticity in the quartz radiator medium (dependence of medium index uq - and hence 
Cerenkov angle - on photon wavelength) ; it corresponds to choosing duq/uQ = 6 10“^. 

• The ratio g of the water to quartz refractive indices was treated as independent of the 
photon wavelength. This is close to the real situation, where 5g/g is typically 10“^. 

• Track multiple scattering in the quartz bar. 

• Number of Cerenkov photons emitted along the track proportional to sin^ Oq and to the 
charged particle path inside the radiator medium. The number of generated photons per 
centimeter has been chosen [^, |[ as iVo = 135. 

• The bar end opposite to the water tank bar exit window is treated as a mirror. 

• Full account of photon reflection and refraction properties inside the bar, providing a 
correct bar acceptance simulation. 

• The active part of the PMT plane has been truncated to half a plane, in order to prevent 
getting fully populated Cerenkov rings. 

In this simulation, the geometry and detector structure are incomplete (compared to the 
hnal BaBar DIRC P): there is no reflecting wedge ending the quartz bar at the water tank 
entrance (suppressing one reflection plane for photons) and geometrical imperfections of the bar 
(expected to be very small anyway) are not simulated. The PMT array here is a plane while 
in BaBar detector, it is umbrella shaped §]. It is packed as a rectangular lattice (while the 
BaBar packing is hexagonal, introducing in this way additional correlations among geometrical 
errors, however of limited magnitude). 

In addition, no interaction of the track with the bar is taken into account (except for multiple 
scattering), in particular no energy loss is implemented. The PMTs spectral response function 
is also not used in the simulation, except for computing the mean water and quartz refractive 
indices (and their dispersion). No photon absorption inside the bar is accounted for. Finally, 
there is no magnetic field effect. 

To summarize, there is no conceptually important difference between this simulated setup 
and that of BaBar and, instead, all relevant features are accounted for. 


The main data sample used for this study is a set of 5000 single electron, muon, pion, 
kaon, and proton tracks having a large bar incidence angle and momentum range (20° < dine < 
70°, 0.5 < P < 5 GeV). This sample could be used in several ways : in normal mode, one 
event contained only a single track ; another mode of operation allowed to superimpose several 
tracks in one event, for physics background studies ; a last possibility was to add a random 
and (spatially) uniform noise to the event tracks, for other noise studies. These background 
conditions can thus be made quite severe compared to normal conditions. The quality checks 
of the whole procedure have been performed accurately by varying at will the magnitude of all 
errors, the background kind and level, and the phase space window. 
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The momentum range where the algorithm has been fully tested goes down to 500 MeV, 
practically the kaon Cerenkov threshold. At this momentum the angular error due to the 
multiple scattering for a pion is about 10 mr (for a bar thickness of 1.7 cm) this is quite 
comparable to the angular error due to the PMT window size (about 7 mr rms). Going to 
lower momenta is possible ; however, in this case the angular error due to multiple scattering 
may become dominant (for a 200 MeV pion, it is about 30 mr, far above the PMT geometrical 
error). In this case the procedure still works, however our hrst order estimates of the errors 
might become moderately accurate and higher order terms might become necessary^. 


6 Photon Selection and Background Removal 

6.1 Outline of the Procedure 

The linearized £t we have described needs, as input, photons with unambiguously dehned 
direction with respect to the track momentum : so, the main step of the photon selection 
procedure is to lower the number of ambiguities arising in the reconstructed photon direction 
due to the various reflection symmetries of our problem (bar surfaces, mirrors) ; ambiguities 
should be selected according to criteria which guarantee the symmetrized photon solutions 
correspond to possible or probable Cerenkov angles for the current track. Ambiguous photons 
cannot be used directly by the £t and so they are ignored during the hrst steps of the procedure 
(part of them will be recovered by a dedicated algorithm, see below). This ambiguity removal 
is actually performed in several steps using different criteria (detailed in section |6.2|) . 

The next step of the selection procedure is the removal of the possible background photons 
contaminating the unambiguous photons population ; an iterative cut procedure involving a 
median estimator is used for this purpose (section |6-4|) . 

The algorithm requires a minimum number of unambiguous photons in order to go on 
(typically 3, but this number can be lowered to 2). If there are enough unambiguous photons 
left, the parameters of interest (Cerenkov angle, coordinates of the center of the Cerenkov circle 
in the equatorial plane) are £t to this set of remaining unambiguous photons. This preliminary 
£t is made only in order to have a hrst approximation of these parameters and of their errors. 

A second and analogous ht is applied to a photon population built by adding to the primary 
set of unambiguous photons, photons which were originally hagged as ambiguous, but which 
have parameters not too far from the parameters resulting of the primary ht, according to 
the distance criterium . These additional unambiguous photons should also fulhll another 
condition which guarantees they are ’’unambiguous”, at least to some extent : for each new 
photon, the two ambiguities closest to the central parameters should be themselves sufficiently 
separated in the relevant distance (see section |t).5[ ). 

This last htting operation produces a new (and hnal) determination of the circle ht param¬ 
eters and errors. 


^Actually, as below ~ 700 MeV, the main PID device in BaBar is the drift chamber by means of the dE/dx 
measurements 1^, it does not seem useful to go to such complications. An overlap region of about 200 to 300 
MeV, where an optimum reconstruction can be performed with the DIRC and using dE/dx, allows already 
interesting cross-checks. 
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6.2 Ambiguity Identification and Removal 

6.2.1 Ambiguous — Unambiguous Cerenkov Photons 

In our problem, each hit recorded in the DIRC PMT array is associated with a recon¬ 
structed track. This primary association is made using straightforward criteria : for each hit, 
the 8 symmetrized Cerenkov angles are selected by requiring that they lay into a physically 
meaningful interval (typically, bounded by the Cerenkov angles corresponding to the extreme 
mass hypotheses). 

This step is important, because it controls the total number of PMT hits/tracks that will 
be examined by the full reconstruction algorithm, and hence influences signihcatively the time 
performance of the algorithm. After this primary simple association step, one PMT hit/track 
pair usually still gives rise to several Cerenkov angle solutions. These different solutions are 
called ’’ambiguities” hereafter : assigning a unique Cerenkov angle to a photon for one track 
turns out to discriminate between these ambiguities. 

Usually, this primary association step leaves an average number of approximately 2 ambigu¬ 
ities per hit, so that a large part of the photons are still ambiguous. To further select our sample 
of ’’solutions”, we restrict even more the allowed range of Cerenkov angles around the various 
mass hypotheses, through a cut in the |56*| = \9soiuUon — 9hypothesis\ variable computed for each 
solution Cerenkov angle. This cut is made typically at the level of 30 mr which corresponds to 
a rather large angular range around each mass assignment ; Fig. ^ shows that this window 
can be naturally dehned and checked on a track sample. Solutions found in the allowed range 
around at least one of the hve mass hypotheses are considered. As Ohypothesis is computed from 
the track momentum and a mass assumption, it is always in a valid range ; OsoiuUon however is 
computed from measured angles and mean refractive index, then, even for the correct solution, 
it may be physically meaningless (i.e greater than the maximum Cerenkov angle). Such values 
have nevertheless to be kept in order to prevent biasing distributions. 

Since other studies P] have shown that the main background in the DIRC are photons 
generated by other tracks in the same event, another cut is made, on the same grounds, to cope 
with this ’’inter track” noise in multitrack events : each PMT hit/track solution surviving this 
30 mr cut is tested versus the other measured tracks of the event to check if it can be associated 
with another track according to the same |56*| criterium (at the more restrictive level of 10 mr 
in this case, see Figure |b). The second |(56'| cut discards only a limited number of photons but 
an important number of background solutions. 

In order to be classihed unambiguous, each photon should give one and only one reflection 
solution relative to the track considered, such that < 30 mr and no solution such that 
\S6\ < 10 mr, relative to any other track in the event. 

The levels of the two cuts (30 and 10 mr) have been adjusted such as to correspond to 
a 3 a and a 1 cr cut; this means they have been kept at a quite loose level, leaving about 99% 
of the signal visible by the algorithm. Obviously, these criteria can easily be adjusted without 
a Monte Carlo, as illustrated by Fig. 

6.2.2 Perpendicular Tracks Recovering 

The previous classihcation of photons as ’’ambiguous” or ’’unambiguous” is not well adapted 
to the case when the track direction is almost perpendicular or parallel to some surface of the 
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quartz bar : in this case, trivial ambiguities appear systematically because of the intrinsic 
symmetries in the bar + track system, i.e. one allowed Cerenkov angle for this PMT hit/track 
pair will always generate two valid solutions per trivial symmetry plane, since these reflections 
have in fact (almost) the same Cerenkov angle. 

To cnre this problem, a narrow cnt on the incidence angle of the track with respect to 
the bar is made which, for this kind of tracks, rnns a procednre discarding systematically these 
trivial additional solntions, keeping randomly one of them. The magnitnde of this cnt is mainly 
related to the geometry of the qnartz bar + PMTs system, and in particular it is chosen qnite 
small compared to the angular size of one PMT as seen from the water side bar exit. In onr 
case, the PMTs have a diameter of 3 cm and are distant from approximately 120 cm from the 
bar exit : the corresponding angular aperture is around 25 mr (corresponding to abont 7 mr 
rms) ; the cnt is set at 7 mr. 

This cnt is slightly biasing bnt, as the trivial ambiguity kept is random, this bias is snrely 
limited. Monte Carlo stndies do not show any clear signal of bias related with this cnt, moreover 
the proportion of almost perpendicular tracks can be expected small at least for phase space 
reasons. 


After having applied the two |(50| cnts and the ’’perpendicular tracks” recovering cnt, photons 
which still allow for several Cerenkov angle solntions are declared ’’ambiguons”, and photons 
admitting one and only one solntion are ” nnambiguons”. The latter ones are used directly 
in the rest of the procednre. The three cnts already described are clearly independent of the 
particle kind. 

6.3 Cut on the Number of Unambiguous Photons 

After the preliminary step described above (step ”A”), a first cnt on the nnmber of nnam- 
biguous photons is made : we reqnire the £t to rnn with at least 2 photons as inpnt. In principle, 
one shonld reqnire 3 photons to be sure to keep a non singnlar 3x3 matrix in the minimisation 
procednre ; nevertheless, the constraint pnt on the circle center, allows to lower safely this limit 
to 2 inpnt photons. The mean ratio of the nnmber of nnambiguons photons to real photon hits 
(having survived the primary association cnts) at this point of the reconstrnction is aronnd 
55% ; this ratio obviously depends on the cnts one is nsing. 

When the nnmber of nnambiguons photons for the current event is lower than 2, in order 
to avoid loosing systematically the track, the algorithm tries repeating step A nsing a less 
restrictive \66\ cnt level which is lowered from 30 to 20 mr in several steps. This part of the 
algorithm concerns only a few percents of the events, and no significant bias could be traced 
back to it. The gain in efficiency of the algorithm after this ’’smoothed” \66\ cnt is aronnd 5% 
of events with two tracks. 

6.4 Background Removal using a Median Cut 

In the previous step, we have already performed the removal of some background effects. 
Indeed, the ambiguous photons represent a kind of combinatorial backgronnd, which is identified 
and left aside provisionally, awaiting improved informations on the Cerenkov circle from the 
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set of unambiguous photons. Moreover, the 10 mr cut on |56*| drops out already most of 
the contamination produced by photons associated with other reconstructed tracks, which can 
affect the unambiguous photon set associated with the correct track. 

After the construction of the unambiguous photons sample, it is possible to improve the 
removal of background photons, or, at least, to remove outlier photons. Further background 
may originate from several sources : 

• Photons created by unreconstructed tracks (low energy tracks produced in the detector 
materials, backsplashes from a nearby calorimeter, ...) 

• Improperly accounted for photon solutions (i.e. wrong reflection hypotheses considered 
as correct unambiguous solutions). 

• Background originating from the accelerator, PMT noise, cosmic rays ... is expected lower 
than the other types of background. 

The hrst type of background is event dependent and may have a complicated structure, 
whereas the second and third types are rather flat in Cerenkov angle space (at least on a large 
range around the nominal Cerenkov angle). However, we have found no signihcant difference 
in the reconstruction behaviour between them and thus, they are treated likewise. The second 
kind of background has been found relatively easy to deal with, at the expense of loosing for 
the £t strongly ambiguous photons {i.e. solutions which are consistent with more than one 
image of the Cerenkov cone, even with the improved accuracy allowed by the £t estimation of 
the circle parameters). These strongly ambiguous photons have not been used in the circle £t ; 
they can nevertheless be counted in order to improve the information on the total number of 
photons associated with the track. 


In any case, it is possible to cut out noise outside a window centered at the median Cerenkov 
angle (ideally, around the correct Cerenkov angle) of the set of unambiguous photons associated 
with a given track. The same window will be used afterwards as the area where the £t will be 
performed. 

The median is used here instead of the mean as central location estimator as it is less 
sensitive to noise and allows a better window setting (the ’’mean” location estimator is known 
to be non robust with respect to outlier tails, see for example iil,lll)- 

In conditions where the background is low (and/or mostly produced by other reconstructed 
tracks), the cut to be performed around the median corresponds to a gaussian cut| of about 3a ; 
in much harder background conditions this cut may have to be lowered to about 2a, affecting 
the probability distribution shape only in the low reconstruction probability region. 

The median cut procedure is iterated on the unambiguous photon sample till convergence 
is reached {i.e. until no new photon is removed), which usually happens after 2 or 3 iterations. 
The ratio of the number of photons, unambiguous at this point of the procedure, to photons 
having survived the first association step cuts is approximately 50%. 

®cr being defined by a theoretical computation of the gaussian errors. 
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In case of even higher background (like for example when many unreconstructed tracks are 
present in the event) the median procedure itself behaves poorly, since background photons 
may accumulate at several locations in Cerenkov angle space, simulating signal for almost all 
mass hypothesis. In this case, it is meaningless to rely on simple location estimators (like 
the median): in particular, the preceding procedure has an identification performance which 
depends on the track momentum when this one is close to the Cerenkov threshold of the 
various mass hypothesis. In such heavy background conditions, it is better to perform a fit 
only in restricted Cerenkov angle areas, and not on the full Cerenkov angle range: in this way, 
the response becomes more uniform. The choice beetween the two selection methods for the fit 
window depends on the noise level, and can be done automatically. 


After the median step, the sample of surviving photons is supposed to be free at least of 
the influence of unwanted tails which could spoil the subsequent fit. This preliminary fit gives 
a first estimation of the Cerenkov angle and the circle center in equatorial plane for our sample 
of filtered unambiguous photons. 

A last ’’cleaning” cut is applied after this first fit in order to remove outlier photons which 
have individual Cerenkov angles far from the fitted Cerenkov angle ; these photons could still 
exist and could degrade the results of the rest of the procedure. Typically, a fixed 3 a cut is 
applied for this purpose. 

After this last cut, the unambiguous photons represent about 45% of the actual population 
of photons associated with the track. The corresponding spectrum is shown in Fig. la- 

6.5 Ambiguous Photons Recovering 

It is possible to get more photons in the final fit after recovering part of the photons 
previously flagged as ambiguous, through a procedure using a criterium : those among them 
which have a contribution to the (estimated at the reconstructed Cerenkov angle and circle 
center computed by the first fit) which is not too high are declared unambiguous and included 
into the fitting sample, with the additional condition that their two best solutions are not too 
close to each other according to distance. 

Practically, this means that the photon solutions are not farther away from the first fit 
Cerenkov angle than typically 3 a and that the closest among the other solutions is beyond, 
typically, 3.5 or 4 a. 

This ’’double criterium allows to input in the second fitting step a sample of photons 
of the order of 65% of the original sample (under large background conditions) or about 80% 
(no additional background). The gain represented by the recovering procedure thus amounts 
to 50% to 100% of the photon number compared to the sample size before the recovering step. 
The proportion of finally used photons wrt the detected photons associated with a track is 
plotted in Fig. ^ for the sample with no additional background. 

One may imagine to replace this double criterium by a single one : keep the best solution 
provided it is below ~ 3cr, whatever is the of the second ambiguity solution. In conditions 
where the background is small, this increases significantly the number of photons - then, the 
accuracy on the reconstructed angle - and the reconstruction quality. In very hard background 
conditions, however, one has to study carefully to which extent the subsequent gain in photons 
does not degrade the reconstruction quality. It is not done here. 
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7 Monte Carlo Results 


The results presented in this section have been obtained using the set of cuts dehned in 
Section The numerical values of these cuts have been tuned depending on the background 
conditions affecting each of the Monte Carlo samples. We postpone to Section ^ the discussion 
on cut handling and tuning. Here we examine the results obtained from analysing these samples, 
in order to draw conclusions on the £t quality and the various aspects of background removal. 

Another important aspect is the algorithm performance for what concerns the separation 
power between the various mass hypotheses, more directly related to the use of the recon¬ 
struction for physics. Many ways to estimate this performance can be devised ; here we will 
discuss only simple criteria. Indeed, even if background effects are realistic in our Monte Carlo, 
and sometimes pessimistic, actual performances can be precisely dehned only with a detailed 
simulation of a given experiment or with real data. Moreover, the ’’performance” requested 
depends strongly on the physics goals one is willing to achieve. 

7.1 One Track Event Display 

In order to substantiate the problem of pattern recognition and background removal when 
dealing with the DIRC, it is not useless to display some events. For this purpose, we show in 
Figs. P and an event with one track (actually a kaon of 1.087 GeV/c momentum) superimposed 
with a low hat background (additional number of PMT hits at the level of about 20% of the 
signal PMT hits). The stereographic projection has been performed with the polar axis along 
the bar axis and then, the Cerenkov circles are not centered at the origin. Circles corresponding 
to the original track are drawn thick (the outer one looks even thicker as the e, /i and vr 
assumptions gives circles nearly superimposed), their images with respect to all symmetry 
planes are the shaded circles shown in only the upper Fig. The lower Fig. || displays the 
same region with only the circles associated with the original track direction under the various 
mass assumptions. The points represented are the solutions surviving the primary association 
cut (see Section above) ; the shaded area is the region of zero acceptance. 

Fig. H illustrates the task of the recognition and expresses as clearly as possible the usefulness 
of the charged track information ; most of the background shown is produced by ambiguities. 
This means that this event is relatively clean. The algorithm described above extracts the 
unambiguous photon subsample (see upper Fig. ^ ; in the present case all unambiguous 
photons are located along the circle corresponding to the kaon assumption. This is the ehect 
of the median cut referred to above ; indeed, by looking at Fig. |^, one clearly sees that 
unambiguous photons belonging to the proton circle (the innermost one) have been removed 
by the procedure. 

The subset of ambiguous photons is shown in the lower Fig. ^ ; the photon solutions which 
are to be examined by the recovering procedure belong to all mass assumptions. The extracted 
ambiguous photons which will be added to the unambiguous photon sample belong only to the 
kaon assumption circle. 
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7.2 One Track Events with no Background 

We first analyse single track events generated with no external backgronnd. This allows to 
stndy the algorithm performances and get qnality checks nnder optimnm conditions. The track 
momentnm range goes from 0.5 to 5 GeV/c and the sample contains eqnal nnmbers of e, /i, vr, 
K and p. 

No (flat) random noise is snperimposed to the event-tracks and no additional track embed¬ 
ded in these events, bnt this does not mean that they are backgronnd free. Indeed, the “com¬ 
binatorial” backgronnd represented by wrongly assigned reflection assnmptions to hits pollnte 
the sets of nnambignons photons. Therefore, we can check simnltaneonsly the behavionr of 
the algorithm and the removal of wrongly assigned photons. In this case, the median cnt for 
backgronnd removal can be pnt at a very loose valne (~ Aa) compared to harder backgronnd 
conditions. 

Fig. |]a shows that the reconstrnction probability is close to be flat : the mean valne 
is 0.52 (expected 0.50), while its rms is 0.277 (expected 0.288), both parameters being close 
to expectations for a flat distribntion. The bias prodnced by slightly overcntting in order 
to remove wrongly assigned (trne) photons is thns negligible. The reconstrnction qnality is 
illnstrated in Fig. ||b, where the Cerenkov angle pnll is represented ; it is very close to a 
centered ganssian distribntion of nnit standard deviation. This means that errors are well 
nnderstood and correctly acconnted for, inclnding mnltiple scattering handling. This also 
means that the identihcation is close to optimnnfl (pions were identihed at the level of 95% 
and kaon contamination was abont 4%). Finally, Fig. ||c shows the bias (6*^“^^ — 0^**) ; the 
mean bias fonnd is abont 0.3 mr, and the spread is abont 3 mr. The nnmber of tracks sent to 
the procednre was 5000, the fraction which was fonnd with at least 2 nnambignons photons, 
and thns reconstrnctedj^, was 88%. Therefore the single signihcant effect of the combinatorial 
backgronnd is to rednce by abont 10% the nnmber of events with at least 2 nnambignons 
photons. Nevertheless, if reconstrnction qnality has to be considered, tracks with at most one 
nnambignons photon (even when lowering the cut to 20 mr) look somehow suspicious. 


The reconstructed Cerenkov angle pulls are plotted for a few bins of track momentum in 
Fig. ^ : the rms of these pulls are quite stable as a function of the track momentum, and are 
close to 1. This stability implies that error estimation is correct, even in relatively small track 
momentum bins : the non ganssian behaviour of the Cerenkov angle distribution, even if not 
smoothed by an averaging over the track momentum, looks quite limited. There seems to be 
a systematic bias (at the 30% level) at low track momentum which decreases with increasing 
momenta ; this should be attributed to harder multiple scattering effects. Indeed, one should 
remark in Fig. |^a that, going to higher and higher momenta, allows at the same time to reduce 
the histogram bias to smaller and smaller values, while the rms become closer and closer to 1. 

Absolute deviations from the simulated Cerenkov angle are also plotted in the same track 
momentum bins on Figure showing the dependence of the errors on the track momentum, 
as noticed before. These plots also show that the general shape of the errors distributions in 
each momentum bin is correct, i.e. not far from a true ganssian. 

®Of course, as expected, the tt -/i-e separation is poor in the track momentum range explored. 

Among the 12% events lost, about 2.2% are protons below the Cerenkov threshold. 
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There is an analogous situation when one examines the dependence of the errors and biases 
of the reconstructed Cerenkov angle with respect to the track incidence angle on the quartz 
bar : in this case, Fig. [1^ shows clearly the existence of a systematic bias for tracks hitting the 
bar with a high incidence angle[^ This seems to corresponds to the already noticed effect with 
low momentum tracks, i.e. larger multiple scattering effect, but due to longer paths inside the 
quartz bar. Indeed, in this case, tracks with high incidence angles may suffer stronger changes 
compared with low incidence angles. The rms dispersion of the pulls looks here also quite stable 
when the track incidence angle varies, as demonstrated in the same hgure. 


Instead of htting the circle parameters on the photon sample extracted from data, one can 
decide to look for the probability (with degrees of freedom) by fixing completely the 
parameters to their values inferred from the charged track direction (see Rel. 0)- The result of 
this exercise is shown in Fig. o for true pions and kaons. The probability distribution happens 
to be flat under the right mass assumptions, while it becomes sharply peaked towards 0, under 
the wrong mass assumptions. Then, in addition to unaccurately reconstructed tracks, the very 
low probability bins may be enriched with tracks carrying wrongly assigned mass. It is usual 
to take this into account by a probability threshold ; in the present case, a cut at about 2% 
looks enough. 

7.3 Effect of Correlations 

The previous sample, affected only by the minimum (combinatorial) background, allows 
to study at various levels the effects of correlations. To be more precise, the data sample is 
intrinsically affected by correlations ; a method to study their effects is to remove correlations 
terms in the analysis, i.e. in the algorithm. It was remarked in Appendix B5, that photons close 
in azimuth are strongly correlated. On the other hand, we know that the circle arc populated 
by the Cerenkov photons can be small. Then, one can guess that the net effect of correlations 
should be indeed strong. 

A hrst way to compute the Cerenkov angle and its error, neglecting all correlations, is to 
set : 




di 


(Tf 


— = E — 


a 


R 




(18) 


and knowing Rtme, one can compute the pull rms. In these expressions, af is the squared error 
of di. This corresponds to hxing the charged track direction to its central value given by the 
tracking device. In such a computation one actually neglects correlations in estimating a^, 
since otherwise we would have used (see Reis. (|TT[) and (|B . 6|) ) and not only 

the trace of the inverse of V, which becomes diagonal when correlations are neglected. The 
corresponding pull is always well centered (reflecting the fact that the charged track direction 
measurement is unbiased) while the pull rms varies dramatically with the spreads ae and as 
illustrated by the upper curve in Figs. |^. 

Angles are expressed with respect to the perpendicular to the bar face from which the particle enters in the 
quartz. 
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Another way to proceed is to neglect correlations in the expression for (Rel. O) and 
solve at minimum. In this case, the result provides a £t value Rju practically not biased. Its 
error is computed from the solution at minimum and from the matrix T~^ (see Rel. (|I6D) 
which gives the errors and correlations for a, b, c, from which one can deduce using : 


= 


iR%, 


< [5c + 2a5a + 2h5hf‘ > 


(19) 


and = c + + 6^, i.e. one takes into account the fact that the actual center is not the 

origin, but is closer to its fit value. The corresponding rms pull is the middle curve in Figs. 
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It shows already a much better behaviour than the previous method result which neglected 
all correlations stricto sensu. This full account of the £t center location mainly explains the 
interesting behaviour of the pull rms at large values for ag = a^j,. 

Finally, the lowest curve in Figs. shows the solution taking into account all correlations 
as explained in the sections above and in the appendices. In this case, the pull rms remains 
always close to 1 and departures are never worse than about 10% only|^. 

Comparing the two Figs. |T^, it is clear that the mean effect is much more dramatic if 


correlations due to multiple scattering are harder (at low mean momentum). It is also clear from 
these hgures that there is a small systematic effect (10 % of the pull rms) that our model does 
not account for. The reasons for this systematic effect are several ; hrst, errors and correlations 
due to multiple scattering are treated statistically^^ ; second, at low momentum and/or at large 
angular errors on the incoming charged track direction, non-linear effects become visible. Figs. 
^ show however that these effects remain of limited influence. 

One may wonder why the second method which neglects correlations gives a pull rms which 
improves with large angular errors on the incoming charged track direction, a behaviour quite 
different from the simple uncorrelated mean (hrst of the methods above). This is actually due 
to the peculiar origin of this kind of correlations compared to multiple scattering. In Fig. ^ 
the pull rms is plotted as a function of 1/p {i.e. for increasing angular errors dues to multiple 
scattering) ; crosses correspond to the second method, dots to the third (standard) method. 
It is clear herefrom that neglecting correlations gives a worse and worse result when multiple 
scattering errors increase. 

The different behaviour of correlations due to multiple scattering only (case A) and to errors 
on the incoming track direction only (case B) can be understood to some extent. Let us assume 
that we are in case B ; if we have had an exact knowledge of the actual charged track direction, 
correlation effects would vanish when using it to subtract the center position from the photon 
coordinates. Using the £t center, in place of of the measured center (here the origin), improves 
somehow the approximation made of the actual charged track direction ; this is well rehected in 
Figs. 0 by the difference of slope between the curves with square and cross markers. Instead, 
in case A, there is clearly no longer one and only one actual charged track direction associated 
with the photon set, and consequently, neglecting correlations should degrade the result more 
and more as multiple scattering effects increase ; this is rehected by the behaviour shown in 


Actually, going to much larger values for ag = a^, the lowest curve remains at about 0.9 while the middle 
curve crosses 1 and goes on decreasing. 

^^We take into account only mean effects due to multiple scattering. Actually, the emission time sequence 
of the detected photons is not known and, moreover, the real multiple scattering effect undergone by the track 
before each photon is emitted is also unknown. Therefore, it seems hard to imagine how to go beyond averaging. 
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Fig. I^by the (cross) curve. Real life stays in between cases A and B, since actually we are in 
a mixed case. 

In any case, as the “physical region” for ag and is expected to be around 1 to 2 mr, it 
is clearly preferable to account for all correlations. Depending on the mean track momentum, 
the pull rms departure from 1 may be as large as ~ 50% instead of the ~ 10% mainly due to 
non-linear effects. This problem is reflected by the probability distributions. Focusing on 
the mixed track sample with 0.5 GeV < p < 5 GeV and neglecting only the correlations in the 

(second of the methods just discussed), we get the distributions shown in Fig. 0. Here are 
displayed the probability distributions corresponding to various values for ag = from 0 -Fig. 
ma - to 5 mr -Fig. |T4^ - compared with the case where all correlations are normally accounted 
for (Fig. 0f). corresponding to 5 mr. We see that, already for small angular errors on the 
charged track direction, the probability distribution is badly distorted compared to flatness, 
while accounting normally for correlations gives an acceptably flat probability distribution!^ 
even at 5 mr. One should also notice from Fig pGj f that the systematic effects already noticed, 
which survives our treatment of errors and correlations at erg = cr^ = 5 mr is not hard enough 
that it would spoil the shape of the distribution probability. 

One thus has to notice the dramatic effect of a bad estimate of the center coordinates on 
the rms pull and then on probability distributions. This effect on the pull rms is actually 
due to a wrong estimate of the errors and correlations. Indeed, as noticed in Section ^ when 
discussing Rel. (j^), this effect affects much less the central estimate of the radius and hence, 
of the Gerenkov angle than the error estimation itself. 


7.4 Track—Events with various Background Conditions 

Here we examine samples contaminated by various kinds of background : flatly distributed 
noise on the PMT detection plane, or merging of the track with one or sometimes two (uniden- 
tihed or identihed) tracks which produce additional photons. The chosen momentum range 
is still 0.5 to 5 GeV/c and the population contains the hve possible particle kinds in equal 
numbers. This kind of background conditions can be considered the hardest as these additional 
tracks enter the same DIRG bar, sometimes with directions very close to that of the track under 
ident ideation. 

The pull of the Gerenkov angle for the sample of mixed particles with one identihed addi¬ 
tional track is plotted on Fig. 1^1. This plot gives two important informations : on the one 
hand, the pull bias remains as small as when there is no background (see Fig. ^b) ; on the 
other hand, the pull rms is close to one (typically 1.2), as one expects if the error model is 
correct. A simple gaussian £t to the distribution in Fig. gives a = 0.95 with a very good £t 
probability. All this shows that the tails to the distributions remain limited and do not affect 
the fit quality. 

Figure |T^, a plot of the difference — 9q^, allows to estimate the absolute dispersion of 
this quantity. Gompared with the case with no background (see Fig. j^), one sees clearly that 
the bias is unchanged and limited (0.3 mr), whereas the dispersion is slightly increased (3.6 mr 

The difference in flatness between Fig. (|^) and Fig (jljf) is simply due to different median cuts : In the 
former case it was set at the very loose level of 4cr, while in the later it is 3 (t. One can furthermore compare these 
two figures and remark that a tighter median cut mainly affects the low probability region by depopulating it 
somehow. 
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instead of 3 mr). 

This dispersion depends, among other things, of the track momentnm and dip angle. With 
this respect. Fig. ^ shows the mean theoretical error on the reconstrncted Cerenkov angle 
as a fnnction of this momentnm ; one can check that this error increases noticeably at low 
momentnm (mainly becanse of mnltiple scattering effects). Errors become smaller and reach 
a minimnm platean at high track momentnm becanse of the Cerenkov angle satnration, the 
greater nnmber of Cerenkov photons (proportional to sin^^c), and smaller mnltiple scattering 
effects. 


The fit resnlts obtained when no backgronnd was added to the track were good (see Figs. 
0and|g). They deteriorate only slightly if noise is added : Fig. 0 indicates that the algorithm 
still behaves well nnder backgronnd conditions connected with the presence of other (measnred) 
tracks in the event. In Figs. EJ) and the reconstrncted Cerenkov angle pnlls are displayed 
when one or two other tracks are snperimposed to the signal track (their existence is known to 
the algorithm, which nses this knowledge to reject part of this backgronnd as ambignities, as 
explained in Section |6.2.1|) ; the dispersion and bias of the reconstrncted angle are fonnd close 
to the normal ones. 

For the case of Fignre E3a, where flat backgronnd has been snperimposed to the signal 
track, the algorithm seems to be more affected, since a sizeable bias of 15% appears, the rms 
dispersion increases bnt stays close to 1. This bias still is limited, even if noise conditions are 
qnite severe compared to what is expected with a real detector. 

Fig. 1^ shows the probability distribntion for — 1 degrees of freedom for an eqnal 
mixtnre of single electron, mnon, pion, kaon and proton tracks. It illnstrates the effects of the 
same cnts on different backgronnd conditions (the cnts nsed have been calibrated on events with 
only a flat backgronnd). The plots are shown for several cases of actnal backgronnd conditions: 
a) no backgronnd, b) random flat backgronnd on the detection plane (at the level of 100% of 
the signal photons), c) 1 other track considered as backgronnd (he. no secondary cnt, no 
knowledge of the existence of the second track, see section |6.2.1|) . 

Fig.p!8|a shows that the actnal effect of the cnts on backgronnd free events is mainly con¬ 
centrated in the low probability bins which appear slightly depopulated, while the rest of the 
distribution looks acceptably flat. Figs. p!B[ b and |T^ show that the main effect of background 
photons is to provide a large peak at small probability, and thus the need for a threshold prob¬ 
ability. These last two figures tends also to show that a large flat background is harder to 
account for than the background associated with photons generated by an unmeasured track : 
in the former case, the low probability peak extends up to ~ 15%, while in the later case the 
effect of this peak is negligible above ~ 5%. This kind of plots, which can be produced with 
any data set, are tools allowing to tune the minimum probability above which the fit of the 
Cerenkov ring is considered. 


As a global check. Fig. |l^ shows a plot of the number of degrees of freedom obtained 

after running the procedure on the same sample of single tracks for several noise environments : 
a) without any background, b) with a random flat background, c) with several tracks per event 
considered as background for the original track. One can see that after the cut adjustment. 
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the mean P^r ndof is close to 1. The rms of the x^ ndof distribution behaves also as 
expected, for example in the case (a) the mean is 0.96 and the rms of the distribution (0.32 for 
a mean number of degrees of freedom of 21) is effectively close to (0.31). The number of 

reconstructed events varies between the three plots because of different cut levels adapted to 
different noise conditions. 

7.5 Particle Identification 

The hnal goal of pattern recognition and circle reconstruction is particle identihcation. 
In order to illustrate how the procedure behaves, we present results obtained for one track, 
assuming another track has crossed the same (and single) bar. The angle between the two 
tracks is random between 0° and 50°. 

In Fig. ^we show the reconstruction of generated pions and present (in a)) the case when 
the accompanying track has been reconstructed and the secondary cut on has been applied. 
In (b), we assume this track has not been measured and then one cannot use this further cut. 
In both cases, no threshold probability has been required and the identihcation is attributed 
to the largest probability (which can thus be quite small). The correct identihcation (except 
for e-p-TT degeneracy) is good, above 90% anyway. One can further see that the secondary 
cut on 156^1 produces a large improvement by reducing severely the misidentihcation of pions as 
kaons and protons. Despite the e-p-vr degeneracy, below ~ 950 MeV pion misidentihcation as 
electron looks negligible and, then, electron identihcation below this threshold is possible with 
a good accuracy. Misidentihcation is low below ~ 3.5 GeV/c. 

In Fig. ^c, we have applied both the secondary cut on |56*| and a threshold probability of 
1% which cleans up almost completely the momentum range below ~ 3.5 GeV/c. It is easy 
to see that, strengthening this threshold to 3 %, sharply reduces the misidentihcation which 
becomes signihcant only above ~ 4 GeV/c. 

All this illustrates that the largest difficulties are due to unmeasured tracks or unstructured 
noise. As soon as a limited additional information is available (secondary track directions), the 
quality of the identihcation sharply improves. Setting a threshold probability at a low level 
appears naturally to be a suitable cleaning up tool, as a non-negligible part of the misidentih¬ 
cation, and hence of contamination, is produced by low probability reconstructions. 

7.6 Summary 

As a matter of conclusion, even under severe background conditions, it is possible to remove 
background and select photons associated with a given charged track with a good efficiency 
(still about 90% of the tracks are ht) and with a limited contamination (about 95 % of particles 
identihed as pions are indeed pions and the misidentihcation as kaons is always low, sometimes 
very low). This is achieved by starting with using the subset of unambiguous photons ; indeed 
htting with them the Gerenkov ring gives access to rehned values of the circle parameters which 
are used when reexamining the ambiguous photons. These improved parameter values allow a 
rehned treatment of the photons left aside as ambiguous. 

The procedure described in Section could possibly be improved, but basically it contains 
most of the needed ingredients. Moreover, the ht algorithm presented in Section ^ looks well 
adapted in order to ht circle arcs and to provide x^ distances for fake photon removal. This 
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method allows to use all basic statistical tools (probabilities, likelihoods ...). Of course, we 
know that fake photons cannot be removed at the 100% level, however, we have proved that 
their residual contamination, in severe enough conditions, can be made low enough that the £t 
quality and its statistical meaning are not spoiled. 

8 Cut Handling 

The various cuts described in Section represent finally a set of 8 parameters : two cuts 
on 1501 allowing to dehne the starting sample of unambiguous photons, the median cut level 
for background removal, the ’’perpendicular tracks” cut used to remove trivial ambiguities, 
the two distance cuts in the ambiguous photons recovering step, the minimum number of 
unambiguous photons, and the tails removal cut after the hrst htting step. 

Some of these cuts can be adjusted in order to adapt the algorithm to different noise and 
background conditions, some are nearly hxed by the detector characteristics (|50| < 30 mr, 
perpendicular track cut), or by algebra (the minimum number of unambiguous photons is set 
at the smallest admissible value). Also, one can verify that the last two cuts only influence 
marginally the reconstruction procedure performance. 

The 3 cuts have a well dehned statistical meaning and the procedure outputs usual x^ 
probabilities. Simple criteria can be used in order to adjust these cuts, in such a way that 
they hlter the background without spoiling too much the signal. Indeed, too stringent cut 
values may bias dramatically the fit quantities or spoil the probability density function of the 
£t results. Too loose cuts may instead result in very low reconstruction efficiency and poor £t 
quality. As, in the DIRC problem, errors are close to gaussians, 3a in tuning these cuts is a 
magic value of well dehned meaning. When having to cut below this level one has to check for 
potentially harmful effects (biases or unacceptable changes in the probability density function 
of the £t results). 

8.1 Tuning of the Adjustable Cuts 

In fact, only the hrst three parameters in the preceding list can be considered as adjustable 
by the user to adapt the algorithm to diherent noise and background conditions. 

As illustrated above, the main tool in tuning the various cut levels is the probability distri¬ 
bution of reconstructed rings. From the most common experience, one knows that if errors and 
correlations are well understood, these distributions are hat between 0 and 1; one can always 
assume this has been checked on clean samples. Clean samples can indeed be constructed using 
a Monte Carlo. Tagged samples of identihed (by other means) particles, or low multiplicity 
samples, can always be extracted from data. 

Then, once the reliability of the error/correlation handling is ensured, any departure from 
hatness has to be attributed to background. The set of cuts dehned above is devoted to the 
removal of the various kinds of background photons. They are tuned by asking the particular 
value of the cut to optimize the hatness of the probability distribution. Ideally, when all cuts 
are at optimum values, the probability distribution has mean value 0.5 and rms 0.289. 

In real life however, one knows that an actually hat distribution always exhibits a peak 
at low probability, rehecting the fact that there always exist conhgurations where events are 
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improperly reconstructed. This situation has been already met here (see Figs. [T8|) . Therefore 
the above rules have to be slightly modihed. Flatness has to be requested above some threshold 
value a, which can be relatively large for the purpose of tuning (a = 10%, 20% or more are as 
good). One can always compute the mean value and rms of the probability distribution above 
this threshold as a function of the cut value and £t it (or compare) to respectively (1 — a)/2 
and (1 - a)/yT2. 

Fig. which deals with a sample of single tracks with no additional background, illustrates 
how this tuning can be performed for the median cut. When hxing the cut level at la, we 
are clearly cutting too tight and then the probability distribution exhibits a huge peak at 1. 
Releasing this cut allows to recover a non-biasing behavior at a cut level of about 3 to 4 a as 
one can expect : the mean probability is found close to 0.5 and its rms close to 1/a/T2. One 
can rehne the tuning in an obvious manner. 

One may notice that the number of entries in the histograms is only marginally dependent 
on this cut level : the algorithm efficiency is not directly affected by it (but the hnal errors on 
the fitted quantities are changing with this cut - roughly like - since the total number of 
unambiguous photons entering the £t is cut dependent, even if the recovering procedure hnally 
limits its variation). 

On the other hand, the low probability peak, as said above, reflects the existence of im¬ 
properly reconstructed rings. This is partly due to conhgurations where the errors are not well 
estimated^ and partly due to the level of background which survives the cuts. Usually, such a 
peak is made harmless by setting a threshold probability. 

The two cut play an important role in the magnitude of this peak. The primary \S6\ 
cut, if it happens to be too loose, accepts more easily background photons in the unambigu¬ 
ous photon subsamples. These background photons will in turn degrade the £t quality and 
contribute to increase the peak at zero probability. However, it cannot be set too tight (for 
instance < 5 mr) since in this case, the number of unambiguous photons primarily associated 
with tracks may decrease too much (below 2), and consequently the number of tracks lost will 
increase without a significant improvement at the zero probability peak. 

The secondary |56*| cut acts likewise as can be seen from Fig. ^ Its strong power of 
rejecting the inter-track noise is illustrated by the vanishing low probability peak in quite hard 
background conditions. 

These examples illustrate how a given cut level can be tuned on any sample of tracks ; here 
again the procedure applies to simulation and real data as well. The cut levels are tuned in 
such a way that the probability distribution has its expected flat shape. This tuning allows to 
recover at the same time a mean probability of ~ 0.5 and a spread of ~ 1 /a/T 2 rms. After 
discarding (eventually) the low probability bins, it is easy to compute the efficiency of any 
further cuts in probability, because of the flatness of the remaining region. 

8.2 Effect of non tunable cuts 

The non tunable cuts, briefly considered before, are the perpendicular tracks recovering cut 
and the minimum number of unambiguous photons cut. The former is tunable only in the sense 

the DIRC problem, the final errors are computed by differentiating functions. They are geometry 
dependent and sometime close to singular points of these functions. 



that it should be adjusted on data or on Monte Carlo simulation not to spoil the algorithm 
acceptance (eventually creating biases if the cut is too wide), and not being rendered inefficient 
by a too stringent width (in this case, only a small fraction of the almost perpendicular tracks 
is concerned by the cut). Anyway, as stated before, this cut has only limited effects : its 
activation or deactivation is hardly noticeable on the studied data. 

The second non adjustable cut has a rather strong influence on the global algorithm accep¬ 
tance. Figure ^ shows the distribution of the number of unambiguous photons at the end of 
the procedure : it is clear that if the cut level is increased starting from the normal value of 2, 
the number of non accepted tracks will be strongly affected, since in this region the distribu¬ 
tion is showing a strong dependence of the number of tracks upon the number of unambiguous 
photons. Normally, there is no point in increasing the cut level above 2 or 3 unambiguous 
photons. 


To conclude this section, we can say that the choice of the cut level for both cuts is not 
really free, but constrained by considerations depending strongly on the use of the algorithm 
and the type of data. 

9 Conclusions 

We have studied a procedure able to perform pattern recognition among photons in order 
to reconstruct the Cerenkov angle associated with the charged track emitting them. The 
procedure, which has been developed for the case of the DIRC, may apply mutatis mutandis 
to data from other ring imaging devices. The basic requirement was that the procedure should 
provide a good approximation of a value in order that a probability can be acceptably 
dehned, even in presence of a huge background mixed to the signal photons. We have shown 
that such a procedure can indeed be constructed and proved to work satisfactorily from the 
kaon Cerenkov threshold up to 4-h5 GeV/c. 

We advocated the use of the stereographic projection which guarantees that the hgure to be 
£t is always a circle, whatever systematic errors, misalignment problems, etc ... are. Moreover, 
we have shown that such kind of errors affects negligibly the estimate of the central value for 
the radius ( i.e. the Cerenkov angle). 

The basic tool of the procedure is a £t corresponding to minimising a expression, lin¬ 
earised as commonly done. In the case of the DIRC, this x^ has to be modihed in order to 
take into account that the photon do not populate the full circle, but rather a relatively small 
fraction of it. The modihcation implemented turns out to take into account in the £t procedure 
the existence of a measurement of the charged track direction, beside the photon directions. 
In this way, systematic errors in the circle reconstruction can be avoided almost completely. 
We have also widely illustrated the role of correlations and methods to estimate them. The 
£t procedure returns a value of the circle radius (connected with the Cerenkov angle) and an 
improved information on the charged track direction. This last information has been shown to 
be crucial in cases where the charged track direction is poorly known, either intrinsically (from 
the tracking device in front of the DIRC), or because of large multiple scattering effects. 

We have shown, that using this tool, it is possible to dehne an algorithm able to solve 
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ambiguities and remove efficiently background photons. It relies on an iterative procedure, 
based on an intensive use of distances. It starts with unambiguous photons and recovers 
additional photons in a second pass. Such a procedure depends on cuts, and it has been 
shown that one can check the effects of these cuts and tune their levels on pull and probability 
distributions. This allows also to dehne a calibration procedure which can be worked out with 
real data in a simple way and optimized in real background conditions. 

With this respect we have shown that the ’’combinatorial” background generated by ambi¬ 
guities can be easily overcome. We have also shown that the background produced for a given 
track by photons associated with other reconstructed tracks was easy to deal with. The hardest 
background is provided by non-reconstructed tracks or flat background of various origins ; we 
have shown that it is possible to deal reliably with them too, by using tighter cut levels. 

As a hnal result, we have shown that the reconstruction and particle identihcation are 
possible through a £t of the Cerenkov angle and particle direction, with a remarkable efficiency 
(above 90 %). We have also shown that the fit probability is correctly estimated and has the 
expected flat distribution. This means that the cleaning up part of the procedure is able to 
discriminate efficiently between hits, even when there is a large background together with the 
signal. The residual background contamination level has been shown to be harmless under 
realistic conditions. Therefore one can use standard statistical tools in order to calibrate cuts 
and check the reconstruction quality. 
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Appendix A : Errors and Correlations in a DIRC Device 

A1 Error Functions of the Charged Track Direction 

The charged track direction is affected by two different kinds of errors. The first kind are 
the measurement errors on the track at the entrance into the radiator (the quartz bar), the 
second is the error produced by its multiple scattering inside the radiator, which makes that, 
for each emitted photon, the Cerenkov angle 9c is relative each time to a slightly modified track 
direction. Let us denote the incoming track direction by ^ = (sin 9 cos 0, sin 9 sin 0, cos 9) ; the 
condition = 1 implies that (f ■ = 0 , and then that the error vector 5q is perpendicular to 
the track direction q. Let us write it : 


5q = 5iq + 52 q (A . 1 ) 

where 5iq refers to the measurement errors and 52 q to the multiple scattering. The error 
functions on the track parameters ( 6 *, 0 ) being denoted {69,64)), we have : 


6 iq = 69 V + sin 964> w (A . 2 ) 

where v = dq/d9 and w = [ 1 /sm.9\dq/84) are unit vectors orthogonal to each other and to q. 
We may have < 6964) > 7 ^ 0. After a path of length u inside the quartz, we also have : 



62 q = [ei{u)v + e 2 {u)w\ 


(A. 3) 


where c = 13.6 10 ^/0p[GeV], [3 is the particle speed and Xq is the quartz radiation 


length. The quantitites £*(«) are gaussian random variables such that < >= 1 and 

< ei{u)e 2 {u) >= 0. Here and throughout the paper we neglect all departures from gaussian 
distributions |^. From Rel. ( |A ■ 3|) , we get : 


< [ 62 q\^ >= (A . 4) 

However, as we don’t know where the photon has been emitted, the best estimate of 
< [ 62 ^“^ > for this variance is its mean value over the path followed, assigning an equal proba¬ 
bility to each possible emission point[^ : 

<< [^2^^ »u= Y [ 2c^^du = (A . 5) 

h Jo Xq Xq 

where L is the total path length of the charged particle inside the quartz. 

Let us assume that two photons are emitted after respectively paths Ui and U 2 inside the 
quartz. As the photon detection does not reveal where it has been emitted, we can have with 
equal probabilities Ui > U 2 or U 2 > Ui. Therefore, up to higher order corrections, we have : 

^®The notation here is obvious : the inner < • • • > denotes the statistical mean {i.e. the expectation value) 
already defined in the body of the text, while the outer< • • • >xy- - denotes the additional average performed 
over continuous variables x,y, - ■ 
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if U 2 > Ui 

< 

if Ml > U 2 


52q{ui) = 

52q{u2) = 
52q{u2) = 

S2qiui) = 


cj^ [ei{ui)v + e 2 {ui)w] 
V -^0 


S 2 q{ui) + c 


U2 — Ml 



Xo 

52 q{u 2 ) + c 


V ^0 
[£ i ( m 2 )m + e2{u2)w 


[e3{u2)v + e^{u2)w] 


Ml — M2 


Xn 


[£3(mi)m + £4(mi)m;] 


(A. 6) 


where the various e’s carry unit variance and are statistically independent when they carry 
different indices and/or different arguments (he. < ei{uj)ek{ui) >= 0 only ii i ^ k and/or 
3 7^ 0 - 

One can check that 52 q{u 2 ) and 52 q{u\) have the same variance given by Rel. (|A . 4) (or by 
(I A ■ 5| l) and it is shared equally between their v and w components (remind that v ■ w = 0). 
Moreover, we can now compute the expectation value : 


< 52q{ui) ■ S 2 q{u 2 ) >=< [52q{ui)]‘^ > 0(m2 - Mi)+ < [S 2 q{u 2 )]‘^ > 0(mi - M 2 ) (A . 7) 


where 0 is the standard step function and the expectations values on the RHS are given by 
Rel. (|A . 4|) with the appropriate argument. It is useful to have an estimate of this correlation 
coefficient ; it is obtained by averaging Rel. (|A . 7| ) upon mi and M 2 . This is easily achieved : 


pL pL 2 

<< 62 q{ui) ■ S 2 q{u 2 ) »uiU 2 = t? / / duidu 2 < S 2 q{ui) ■ 62 q{u 2 ) >= -c 

Jo Jo o 

Therefore, the average correlation amounts to 2/3 of the average variance, 
fraction carried by each component are : 


V 


The covariance 


<< [ 52 qiui) ■v\[S 2 q{u 2 ) ■ v] » U 

< 

« [hqiui] ■ w\[ 52 q{u 2 ) ■ w] » 
while all other covariance mean values are zero. 


1 . L 

3 "" ^ 

1 . L 

3 "" 


(A. 9) 


A2 Finite Size Sample Corrections to Multiple Scattering Errors 

It is clear that the length of the path followed before the track emits any photon is inacces¬ 
sible, nor the ordered time (or path) sequence of the detected photons. This implies that one 
has to work with averaged quantities. In the previous subsection, averaging is defined by Reis. 
(|A . 5|) and ( A ■ 8|) , for respectively the variance and covariance terms. Averaging by integrals 
assumes the emission of an inhnite number of photons along the charged track path inside the 
radiator. 

Here we present another method, based on a finite number of radiated photons ; it allows 
to find the 1/n corrections to the above method, while checking it conceptually. 
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Let us assume n detected photons are emitted along the path of length L. Photon acceptance 
is mainly connected with their azimuth on the Cerenkov cone ; therefore we can assume for 
simplicity, that these photons are emitted after equal paths of length L/n. Let us also assume 
we work with each coordinate of the circle center in the equatorial plane (hnal results for 
variances have to be multiplied by 2 for comparison with the preceding subsection). We denote 
by Xi the coordinate of the charged track direction (actually its fluctuation around xq, the true 
track coordinate]^ at the radiator entrance) when it emits the photon in its ordered time 
sequence. Then we have : 


' xi= xo + ei 
X2= Xo + ei + €2 
^3 = Xq + ei + €2 + 63 


(A . 10) 


where the functions are centered independent random variables : < >= 0 and < e^e/ >= 

= c^L/(nXo), c being already dehned). Let us also dehne A = na"^, a quantity 
independent of n, which coincides with the standard of Review of Particle Properties 
16|. For simplicity, we choose from now on xq = 0. Trivially, writing < >= 0 for all fc, does 


not mean it is true for any given track, but that this is fulhlled by the mean values computed 
from a large sample of tracks. Indeed, one set of x* corresponds to one track and then to one 
sampling of the e^’s. 

The ordered time sequence is unknown ; nevertheless, we can dehne the multiple scattering 
variance of the sample by : 




1 

n 


E 


X 


2 

k 


k 


Using Reis. (|A . 10|) , it is easy to hnd the expectation value of Rn 


(A . 11) 


n + 1A /. N 

< Rm >=-- A . 12 

n 2 

which tends to half the value in Rel. ( |A . 5| ), when n 00 , as expected. Here we have to split 
up the result into two parts (correlated and uncorrelated). The correlated part is the variance 
of the center of gravity : 


G = -J2xk 


n 


(A . 13) 


It is easy to compute it and get : 


< G" >= 


{n + l){2n + l)A 

2 ^R ^ 


(A . 14) 


^^Obviously, the true coordinate at the DIRC entrance is approximated by the mean value provided by the 
tracking device in front of the DIRC, but does not coincide with it. 
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which tends to 2/3 of < Vm > when n —>■ cx), in agreement with Reh 
uncorrelated part (reduced variance of the sample) is the mean value of : 

W^ = -Y.{xu-Gf (A. 15) 

which is : 

1 A 

< >= (A . 16) 

0 

and tends to 1/3 of the variance when n —> oo. Therefore, the integral averaging presented 
in the previous subsection gives indeed the large n behaviour of the mean multiple scattering 
effects. The subleading terms are of the order 1/n and small ; they can easily be read off the 
results in this subsection. They show that the sharing uncorrelated-correlated parts of the 
variance departs from 1/3 : 2/3 by terms of the order l/(3u). They may become important 
only for number of photons of the order u < 4 -P 5. 

A3 Handling of Other Errors 

There are qualitatively two kinds of errors we have to deal with in the DIRC problem. The 
first kind are mostly geometric errors due to the reconstruction of the photon direction : finite 
size of the photomultipliers, finite size of the bar entrance window. There is no difficulty of 
principle in their reconstruction and propagation from the water tank and photomultipliers 
back into the quartz bar and we will not comment on this any longer. 

The second kind of errors (chromaticity) is due to the dependence of the refraction indices 
(water and quartz as far as the DIRC is concerned) upon the photon wavelength. This is due 
to the fact that photons emitted by Cerenkov effect do not have a definite wavelength, which 
rather runs over a relatively wide spectrum. 

When refracting back the photon direction from water (index nw[X)) to quartz (index 
? 7 ,q(A)), it is appropriate to consider as basic random variable the quartz index ; then, the ratio 
g of these indices can be expressed as a function of uq . In the BaBar setup, this ratio can be 
considered constant over the range of wavelengths to which the photomultipliers are sensitive. 

The quartz index is surely an appropriate variable because, thanks to Reh (i. it affects 
directly the expected Cerenkov angle 6 c and its error for each photon. This is an important 
source of errors (it turns out to be equivalent to treating the index as a random variable with 
a standard deviation corresponding to 5n/n ~ 0.6%). This error can be either attributed to 
the charged track direction or to each photon direction separately, but, in the former case, one 
has to take into account the fact that these photon errors are uncorrelated (this gives rise to 
corrections in the error magnitude once applied globally to the track direction ). In the course 
of the fit procedure, the which allows to reconstruct the circle radius takes naturally this 
effect into account. Moreover, when fixing the Cerenkov angles to the values expected from 
the charged track momentum, assuming the possible mass assignments (the consistency check 
referred to in the body of the text), it allows to take into account automatically the spread due 
to the photon wavelength spectrum. 


A . 8). The mean 
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Appendix B : Basics for a Circle Arc Reconstruction Al¬ 
gorithm 

As noted in the main text, when htting a circle having at hand points (subject to mea¬ 
surement errors) spread out onto a relatively small arc (typically 60°), and if the error on each 
point is non-negligible compared to the radius of the circle^, the £t quality becomes poor. 

Indeed, one can attempt a circle fit using standard algorithms [|^ [^, |^. However, fitting 
the 3 circle parameters under the typical conditions sketched above, using only the points on the 
circle leads to unexpected bad results. The center coordinate in the direction associated with 
the two outmost “experimental” points is relatively good ; however, the center coordinate in 
the direction toward the arc is signihcantly and systematically displaced towards the arc (com¬ 
pared with its expected value) and consequently the £t radius is systematically underestimated, 
sometimes badly. 

One way to circumvent this problem is to introduce additional information about the circle 
center (the charged track direction as measured by another device than the DIRC, for instance 
a drift chamber). In this case, the £t algorithm works much better (as illustrated in this paper) 
in the sense that the pull is found with the expected unit standard deviation and a negligible 
residual methodological bias[^ 

B1 A Naive Estimate of the Error Matrix of the Charged Track 

The vector 5q dehning the error on the charged track direction q is given by Reis. ( [A .| 
ly (|A . 2|) and ( |A . 3|) . It is perpendicular to the charged track direction, and it projects out 
onto the equatorial plane (through the stereographic projection) as 5q/2. Denoting a and b the 
coordinates of the circle center in the equatorial plane in the directions parallel resp. to v and 
w (both orthogonal to we can define the error functions on the circle center by : 


1 ) 


where u is the path length followed by the charged track inside the radiator. No average value 
over the path length has to be performed in this naive approach. Doing this way, the error 
matrix S for the charged track can be computed by taking the expectation value of appropriate 
second order terms. One can then choose, as a rule of thumb, to approximate the error functions 
above by considering the standard deviations ai u = L/2, i.e. at half the full path of the charged 
track inside the radiator ; in this case, we have : 

With this respect a ratio of 2%, typical for the BaBaR DIRC, is large when points are on a circle arc of 
about 60°. For a track fit in a drift chamber, 60° is relatively large as the ratio of error to radius is here of 
about 10“^. 

^^Practically, the residual methodological bias is of the order 10“'* of the radius {i.e. about 0.1 to 0.2 mr for 
the Cerenkov angle) and then it is overwhelmed by the bias originating from fake photons, which cannot be 
completely removed by any realistic procedure. 


56 = ^ sin eScj) + JJe 2 (M) 


(B 
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rough estimates : < 


(B.2) 


< [Sa]‘^ >= 

< [ 56]2 >= 

< 6 a6b >= 


1 

4 

1 

4 


< m‘ > 


sin^ 6 < [50]^ > +2*^^ 


- sin 0 < 595(1) > 



where the quantities < [56]‘^ >, < [5(j)]‘^ > and < 565(j) > are the elements of the error matrix 
So provided by the reconstruction procedure from the tracking device in front of the DIRC bar. 
The question now is to get a motivated estimate of the full error matrix S of the charged track 
direction by taking into account theoretical a priori information on multiple scattering. 


B2 First Approach to the Charged Track Error Problem 

Let us consider only the a coordinate of the circle center, or restrict our problem to a one 
dimensionnal aspect. Solving the 2-dimensional case (a, b) will follow straightforwardly. 

Actually, what is of relevance for our problem, is the mean value and error on the directions 
of charged tracks associated with the emitted and detected photons. There is some difference 
with the errors at half path inside the radiator, as will be shown below. 

For a charged track entering the radiator medium, the tracking device provides a measure¬ 
ment of the direction with its error matrix (Sq referred to above). Let us associate with this 
measurement the origin in the equatorial plane where all directions are projected out. 

When the charged track emits photon i, it has a given (even if unknown exactly) direction ; 
this direction varies from photon to photon simply because of the multiple scattering the charged 
track undergoes. The intersection of this direction with the unit sphere has for image in the 
equatorial plane the point of coordinate Oj along the direction v (see Section Al) ; what is of 
relevance for the Cerenkov angle estimate is clearly the mean value and standard deviation of 
the set {oi ; i = 1, - ■ ■ n}. 

On the other hand, even in perfect cases (no multiple scattering) the actual direction of 
a given track is obviously not the mean value (dehned here as the central value given by the 
drift chamber reconstruction). Fitting the Cerenkov cone allows possibly to improve this last 
estimate using a (large) number of additional informations (photons), going thus closer to its 
actual value. The effect of multiple scattering is that the actual center seen from each photon 
is different and randomly distributed ; then we cannot access one actual circle center, but only 
dehne a mean value of the set of actual centers. 

Moreover, if photons can allow for improving the average estimate of the charged track 
direction, errors can be estimated from the (underlying) parent random distribution. 

Therefore, we can write : 


tti = Oq + 5ai , i = 1, ■ ■ • n 


(B.3) 


where Oq denotes here the expectation value of the center of gravity of the {oj} set^ and where 
the random error functions 5ai given by Rel. (|B . 1|) with different paths Uj, are unbiased (< 


20 


Actually, this turns to define the Oi’s by relations like Reis. (A . 10), with an additional offset ag in place 


of Xq- 
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6ai >= 0). In connection with what said just above, it should be stressed that the expectation 
value Oo for all measured photons associated with one track is not necessarily zero, but the 
set of all {oo} (each associated with a given track) is surely distributed around zero, with 
known deviations. Then, what is of relevance in our problem is the value of oq and its standard 
deviation on a track by track basis. 

In order to get the mean value and the error of the direction set {a*}, one can minimize the 
function (we dehne a vector g of components Qi = 1, Vi = 1, • • - n in order to match indices) : 


F(a) = {agi — ai)V^j ^{agj — aj) , with Vtj =< doiSaj > (B . 4) 

where summation over repeated photon indices is understood. The zero of dF{a)/da gives this 
minimum (using also Expression ( [B . 3|) ) : 


_ 9i^ij 

aiSiVij' 


(B.5) 


This expression gives the usual result for the estimate a from a set of measurements {oj} 
in the least squares approach. Its expectation value|^ < a >= Oq is unbiased and its error 
function 5 a, which can be read off Rel. (|B . 5|), allows to compute its standard deviation a a 
{al =< >) : 


4 = didiVij ^ (B . 6) 

When there is no correlation (Vjj ~ 5ij), this expression gives the usual result (a”^ = 

In general 5ai and 5aj are given by expressions like in Reis. (|B . 1|) with two different path 
lengths Ui and Uj, both unknown. Correspondingly the quantity Vij =< 5ai{ui)5aj{uj) > can 
only be estimated by performing the average as presented in Section Al of Appendix A. This 
gives : 


5ai(yVjj^5aj(yVjj^ ^^uiUj 


BE + —AI 
24 


j ij 


(B.7) 


where E is a rank 1 matrix such that each Eij = 1 ; here B = [< [59^ > +l/3A]/4 and 
A = (?L/Xq. In order to compute using Rel. (|B . 6|) , we need to invert V just dehned. 
Indeed, using = nE, it is easy to prove that : 


V = \I + fiE = hl - ^E] (B . 8) 

n/i + A ^ ^ ^ 

where n = dim I = dim E is also the number of points (photons). Using this formula with the 
matrix in Rel. ( [B . 7|) , we easily hnd : 

= j[< ['591">+U + W] (B,9) 

all ai have been fixed at zero (the measured central value) for the track considered, this corresponds to 
take as sampling 5ai = —aogi. 
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This relation could be expected beforehand. It shows that the uncorrelated part of the 
error affecting each (2/3 of the variance associated with multiple scattering as shown by Reh 
(|A ■ 8|) , plus the variance provided by the device in front of the DIRC bar) is transfered to a 
without changes, while the correlated part (1/3 of the multiple scattering contribution to the 
full variance) scales with n. If we had neglected the terms generated by the multiple scattering 
effects in Vij (for i ^ j), we would have got instead = [< [69]“^ > +A/(2n)]/4 which can be 
considerably smaller at low track momentum. 

Actually, the finite size sample corrections (see Section A2) may be accounted for. Con¬ 
cerning the uncorrelated part, this would amount to l/u^ corrections and can be neglected ; 
the correlated part gives however a 1/n contribution which corrects the term A/ 6 n in Rel. (|^ 
. 9) by a factor of 4. Therefore, an improved expression for taking into account all 1/n 
corrections is : 


<^2 U = jl< [SOf > +1.4 + Aa] (B . 10) 

It is also interesting to compare Reis. (|B . 9|) and (P . 10|) with the hrst Rel. (|B . 2|) . 
Indeed, one clearly sees that the variance for a is smaller than the variance at mid path inside 
the quartz as soon as the number of photons is greater than 1 (large n limit) or 4 (finite n 
corrected). Then, in all practical applications, Reis. ( |B . 2|) do not reflect the sharing of the 
variance between correlated and uncorrelated parts and leads to an overestimate of the multiple 
scattering contribution to the center errors. 

B3 The Full Charged Track Error Problem 

We have just treated (as a one-dimensional problem) the determination of the error on the 
the a coordinate of the circle center, taking into account the number n of emitted photons. We 
have seen that the uncorrelated part of its variance decreases as 1/n while the correlated part is 
unaffected. However, our actual problem is two-dimensional and because of correlations terms 
like < 6 ai 6 bj >, it is not equivalent to the conjunction of two one-dimensional problems. 

In order to complete the treatment, let us display the following identity for the inverse V~^ 
of a symmetric matrix V of rank and dimension n : 


[ Si c \ ^ ({Si-cs^^c)-^ -{Si-cs^^cy^cs^^ 

^ ^ ^ 

\C S 2 ) V -^2“^C(Ri - CS^^C)-^ 82 ^ + S 2 ^C{Si - CS 2 ^C)-^CS 2 ^ 

(B . 11) 

where the submatrices Si and S 2 are square matrices of dimensions resp. k (< n) and n — k, 
while C and C can be square or rectangular. This identity allows to invert easily a large and 
structurally complicated matrix, when submatrices have peculiar forms, easy to invert (like 
Rel. (P . 8|) , for instance). 

Let us apply this relation to our problem. We have, by construction. Si ij =< 6 ai 6 aj >, 
S 2 ij =< Sbi 6 bj > and Cij =< 6 ai 6 bj >, where the error functions are given by expressions like 
Rel. (p . 1|) with appropriate path lengths. It is easy to compute these (sub-)matrices : 
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(B . 12) 


' Si = BsE +I 
= £ + ^21/ 

C = Borf, E 


, Be = \[<[5ef>+\A] 

, B^ = ^[sin^ e < [5(t)f > +^24] 
, Be^ = -sin6' < [505(f)] > 


where the rank 1 matrix E has been already dehned and still A = (?L/Xq. We clearly have 
C = C and all snbmatrices here are n x n. Rel. ([B . 11|) is useful in our case because these 
three submatrices have each a special form. 

We can now dehne two sets of relations analogous to ([B . 3|) for the a* and bt introducing 
this way Oq and Bq, and the vector of dimension 2n : (• • •, agi — ai 


Mj 


^31 


Then we 


can dehne a function = F(a, h) in a way analogous to Rel. (|B . 3|) , using this vector and the 
matrix of Rel. ( |B . 11|) . Doing as previously, it is easy to hnd that the solution which minimizes 
F(a, h) is a couple of random variables (a, h) of expectation values (oq, &o) (to be ht) with an 
inverse of covariance matrix dehned by the sum of the elements of each of the four submatrices 
in Rel. ( |B . 11| ). These sums can be easily computed knowing the submatrices Si, S 2 and C 
(Reis. ([B . 12|) ), by means of Reis. ( |B . 11|) and (|B . 8|) . 

After tedious algebra, it follows from there that the center hxing term can be written : 


Xc ~ (®05 ^ 0 ) 




- sin 6* < 595(f> > 


sin^ < 565(f) > 


^[sin^ 0 < [5(t)f > + ^A] 


( do 
V bo 


(B . 13) 

In writing this expression, we have used oq and bo instead of oq — dmeasured and ho — bmeasured, 
taking into account that the corresponding measured values are zero by dehnition. This relation 
dehnes the error covariance matrix of the charged track direction associated with n detected 
photons ; it dihers from the matrix Sq by its taking into account the multiple scattering 
undergone by the charged track inside the radiator. It can simply be written S = Sq + [A/3 + 


A/(6n)]/. 

One can apply the hnite size sample corrections found in Section A2, as we did in the 
previous subsection. This amounts to change the expression for S to S = Eo+ [A/3 + 2A/(3n)] J. 

Finally, it should be noted that the ao&o covariance term is not affected by effects due 
to multiple scattering ; this could have been inferred from Reis. (P . 1|) (and explains the 
covariance term in Reis, (p . 2|) , as the expectation value < ei{u)e 2 {v) > is zero for any values 
of u and n). From now on, we name for clarity the parameters to be £t a and b instead of Oq 


and bo¬ 
lt should be noted, however, that multiple scattering effects imply that there are as many 
centers (charged track directions) as photons. Therefore, any £t procedure can only provide a 
determination of the mean center coordinates for each track considered, as an approximation 
of the actual center value at the DIRC bar entrance. 
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B4 The Minimization Function and the Circle Parameters 

Given a set of points known each with some error, and assuming they should be on a circle 
arc, the problem we state here is to dehne a function F{a, b, R), the minimum of which providing 
the circle parameters. An usual approach is actually to choose as function F, a Denoting 
by R the circle radius and by (a, b) the center coordinates, the function is : 


i,j=l,n 

where R = {xi — a)^ + {yi — by is the distance of each point {xi, yi) to the £t center, and 
Qi = 1 defines a constant vector g introduced only in order to have a correct matching of 
repeated indices. A priori, the covariance matrix is defined by Vij =< 6 di6dj >. Usually, the 
error functions Sdi are obtained by differentiating the expression for d* : 


6di 


{xj - a)6xi + {yi - b)5yi 
di 


(B . 15) 


where 5xi and 5yi are the errors functions affecting the photon measurements Xi and yi. 

In our case, the points are spread onto a small arc and/or the relative size of the errors 
compared with the circle radius is large ; then, one has to introduce additional information for 
reasons already quoted at several places in the body of this paper. The most obvious additional 
information which is available in our case refers to the circle center. 

We have as a priori information the measurement provided by the tracking device located 
in front of the DIRC bar entrance ; this is summarized by a central value and a covariance error 
matrix (referred to anywhere above as Sq). Obviously, this defines a distribution (normal, 
assuming we are lucky) but not the actual location of the center indeed associated with the 
track under consideration. 

The question is now : how to introduce the approximate knowledge (0, 0) of the given 
charged track direction and keep anyway its actual center coordinates (a, b) to be fit? In the 
previous case (no charged track information), the measured quantities could be written : 


Xi = x'l + a + 5xi , yi = y'^i+b + dyi (B . 16) 

where x/ = R cos (pi and y^ = Rsinpi. Here R is the true radius, pi is the true azimuth on the 
circle and (a, b) the true center. Then the true value for the x* and yi are respectively x/ + a 
and y^ + b. Fixing the center at the “measured” value {ai,bi) (actually (0,0)), turns out to 
rewrite these equations : 


Xi = x^ + tti + Sxi , yi = + bi + Syi (B . 17) 

However, for a given well defined track, we can write : 

tti = a + Stti , bi = b + Sbi , Vi = 1, • ■ ■ n (B . 18) 

where 6ai and 6bi are the error functions which take into account the errors at the entrance of 
the DIRC bar and the multiple scattering undergone by the charged track up to the point where 
it emits photon i. In this way (a, b) is the actual center when it exists (no multiple scattering), 
otherwise it can be formally defined as the mean value of the quantity corresponding to G in 
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Rel. (|A ■ 13 ), which is then non-zero on a track by track basis. Then Eqs. 
rewritten : 


17) can be 


Xi = + a + 5ai + 6xi , yi = Vi +b + 6bi + 6yi 


(B . 19) 


The difference between the case when the center is left free and when it is constrained is 
transfered to the error functions which become Sai + 6xi and 6bi + 6yi instead of respectively 
6xi and 6yi. Conceptually, the difference comes from what is submitted to fit in both cases. 
In the former case (free center), the measured quantitites submitted to fit are the measured 
points {xi,yi), while in the latter case (constrained center), the quantitites submitted to fit are 
actually {xi - ameasurement, yi “ bmeasurement)■ In the former case, the center (a, b) is fully fit, in 
the latter case one fits the departure of the actual center from the measured point (0, 0). 

Then, Rel. (|B . 15|) is still valid but should be rewritten : 


5di = 


[xj - a){Sxi + Saj) + {yj - b){6yi + Sbj) 
di 


(B . 20) 


in order to keep 6xi and 6yi their original meaning (errors due to the measurement of the 
photon direction without reference to the charged track). If, moreover, we choose the origin 
in the plane in order that it coincides with the image of the track direction provided by the 
tracking device, Rel. (|B . 20|) can be approximated by : 


5di = 


Xi{Sxi + Sat) + yi{6yi + 6bi) 

d' 


(B . 21) 


where d' in the denominator is d' = \jx‘f + yf. Sdi in Reis. ( [B . 20|) and ([B . 15|) differ only at 
first order and only by the differentials for da* and ddj. 

Therefore, the quantity Xn (Rel. (^ 
errors functions are given by Rel. (p~ 


1^ ) can be used with Vij =< 6di6dj >, where the 
'^ ). As they depend on the path length followed up 


to the emission of each photon, this expression has to be approximated by its mean value 
=<< ddjddj >>uiUj easily computable using all information given above. 

In this way, we are in position to define xi ^ function of (a, b) and R. It remains to 
account for forcing the coordinates (a, b) to remain in the neighborhood of the measured point 
(0,0) ; this is achieved]^ by defining the following : 


2 2,2 
X =Xn + Xc 


(B . 22) 

using Reis. (|B . 14|) and (|B . 13|) , where the influence of multiple scattering is already taken 


into account, including correlations. While is n — 3 degrees of freedom, the x^ just defined 
is n — 1 degrees of freedom. 

One could ask oneself about the influence of the additional term Xc when minimising, if 
the circle arc happens to be large enough that such a term is actually useless. We have checked 
numerically this case using trivial simulations, where the populated arc length and the number 
of ’’measured” points could be varied at will. We have found that for large circle arcs (about 


^^One may ask oneself whether Rel. ( B . 22 ) actually exhausts the problem. Indeed, one may be tempted 
to introduce in the to be minimized, terms coupling a and b with the di — Rgi ; this is not studied here. 
Anyway, such additional terms would surely degrade the algorithm speed. From the results already at hand, 
one may conclude that their effect is small for track momenta above 500 MeV/c. 
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180° or more) the additional term xh prevent to get the same solution as when it is 

removed, with completely negligible fluctuations. 

Another possibility could be considered. This turns to give up fitting the center, accepting 
the measured value (0,0) as optimum. Using the notation d[ = di{a = 0, 6 = 0), this turns to 
estimate the radius by : 




a 


R 


= E 9iVij ^9j 
ij 


(B . 23) 


This gives results close in quality to minimising Eq. (|B . 22|) , if the fixed center (a, b) = (0, 0) 
is correctly measured. If however, there is any bias in this estimate, it may become much worser. 
Indeed, in case when the measured center is slightly biased, minimising Eq. ([B . 22|) is safe 
as one recovers the correct center location, even starting from a wrong center ; instead, using 
Eq. (P . 23|) , the estimate of the radius suffers piling up effects which summarize in sensible 
effects. For instance, assuming 2.8 mr systematic error on the charged track direction (known 
with statistical accuracy 2 mr rms), the radius pull gets a rms of 1.12 instead of 0.98, under 
the same conditions, for the procedure we recommand. This difference of behaviour degrades 
in presence of background (1.4 compared to 1.2) or if the statistical accuracy on the charged 
track direction worsens. 


B5 Correlations among Photons in the Equatorial Plane 


The radius tan 0(7/2 of the circle is approximated by each of the photon distance to the 
common center (d' = + yf ). Each such estimate is affected by an error function which 

is given by Rel. (P . 21|) , with the measured center set at the origin in the equatorial plane. 
From expressions given above, we can then deduce : 


< > = 


< 

< 6di6dj > = 


(B . 24) 

where we have assumed that the measurements {x and y) for photons i and j are statistically 
independent for ease of reading. 

These relations give the expression for the elements of the error covariance matrix which 
enter the fit procedure described in the body of the text and just above. The variance for each 
estimate of the circle radius (d') depends on the photon errors, the track direction errors and 
the multiple scattering it undergoes ; the covariance term exhibits an interesting feature : up 
to the fact that the metric along v and w are different, one sees a surprising correlation pattern. 


1 


xl[< [5xif > >]+yi[< [^Vi? > T^sin^d < >] + 


2xiyi[< 6xi6yi > +- sind < 696cj) >] 


1 p L 

^ s'" E 


4d'd' 


XiXj < [dd]^ > +yiyjsm‘^9 < [d0]^ > + 


/ , ^ .1 , (xiXj+yiyj) 1 2 L 

[Xiyj + xjyi) Sind < 696 (j) >] +- — -—c — 
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Qualitatively, correlations are the strongest (and positive) for photons close to each other in 
azimuth, correlations are the strongest (and negative) for pairs of photons opposite in azimuth, 
while there is no correlation for photon pairs having azimuthal distance of 7r/2. 


B6 Multiple Scattering Effects in the General Case 

When the arc to be £t is large enough (possibly 27r radians), hxing the circle center becomes 
irrelevant. In this case, a question remains about the influence of multiple scattering on the 
error dehnition and the £t procedure. 

For each photon, di = ^{xi — a*)^ + (?/j — bi)^ remains the basic quantity which enters 
the £t procedure. Whatever is the way to express the problem, we have as free parameter 
the charged track direction and as “data” the angular distance of this direction with photon 
directions. As noted above, when multiple scattering is active, the direction of the charged 
track varies from photon to photon with theoretically known statistical fluctuations. Therefore 
the estimate of the radius provided by each photon inherits the fluctuations of the charged 
track. Stated otherwise, the error due to multiple scattering can either be treated separately 
(as we did) or included in the error function of the measurement {xi,yi), together with the 
other contributions (geometrical errors, chromaticity error). This means that Rel. ( |B . 21| ) is 
still relevant. Therefore, when there is no center hxing term, it is equivalent to consider that 
the error function on {xi, yi) is {5xi + Syt + 5hi) , where the second term of each component 
is reduced to only the multiple scattering contribution. 

In this case, Reis. ( [B . 241) become : 


< [ddif > 

< ddiSdj > 



x] < [5x^f > +y1 < [5yif > +2xiyi < SxiSyi > 




L 


{xjyj + xjyj) 1 2 _^ 

12^ Xo 


(B . 25) 


This shows that correlations among photons always exist, only due however to the properties 
of multiple scattering. Therefore, for low momentum tracks, when the multiple scattering is 
dominant, correlations among photons can never be ignored in any reconstruction procedure 
for devices like the DIRC. This is clearly independent of the representation chosen for the data 
(here the stereographic projection) ; it only relies on the fact that the measured quantities are 
angles between photons and a single charged track direction which changes in a correlated way 
from one photon to the other. 
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Figure 1: A schematic drawing illustrating the DIRC principle : photons produced in the quartz 
bar radiator are transported to the bar end because of total internal reflection. Photon angles 
are preserved till bar exit. 
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Figure 4: Distribution of S6 (radian units) for the mixed track sample with one additional 
reconstructed background track; in (a), S9 as measured from signal track; 69 in (b), as measured 
from background track. 
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Figure 5: In a), histogram of the proportion (in percents) of unambiguous photons relative to 
the total number of detected photons (all are signal photons) for each track before the photon 
recovering step. Non accepted events appear as a peak at zero. In b), histogram of the same 
quantity after the recovering procedure. In both cases no flat background and no additional 
tracks have been added. 
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Figure 6: Display of DIRC Cerenkov (solution) photons for one kaon track produced together 
with a low flat background (see text). An equatorial projection wrt the bar axis is used here. 
Top : all photon solutions surviving the primary selection cuts are displayed with the various 
reflected circles ; Bottom : same view, keeping only the circles associated with the original 
track direction. c-n 
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Figure 7: Display of DIRC Cerenkov (solution) photons for the event shown in Fig. Top : 
unambiguous photon solutions found by the algorithm ; all lay along the kaon circle. Bottom : 
all ambiguous photon solutions to be examined by the recovering procedure : they are spread 
along all possible circles. 
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Figure 8: Distributions obtained using Monte Carlo data generated without additional noise. 
In a) the probability distribution ; in b), the reconstructed Cerenkov angle pulls ; in c), the 
absolute biases (radians). Notice in b) and c) the smallness of the tails. 
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Figure 9: Estimated error behavior. In a) the estimated pull dispersion (rms) of reconstructed 
Cerenkov angle around the expected Cerenkov angle in various track momentum bins ; b) rms 
dispersion of the reconstructed Cerenkov angle (in radians) also in the same track momentum 
bins. Notice in a) the trend to go towards a pull of zero mean and unit rms, when going to 
higher and higher momenta ; this typically reflects the decreasing effect of mnltiple scattering. 
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Figure 10: Reconstructed Cerenkov angle pul 
The mean and rms are close to the expected 
the mean is slightly too large. 
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Figure 11; probabilities of pion and kaon identification. In a), pion hypothesis probability for 
true pion tracks ; in b), kaon hypothesis probability for true pion tracks ; in c), kaon hypothesis 
probability for true kaon tracks ; in d), pion hypothesis probability for true kaon tracks. 
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Figure 12: An example of the Cerenkov circle fit and the inter photon correlations effect when 
the errors on the charged track direction are changed. In a), the full track momentum range is 
used, whereas in b) only low momentum tracks are considered. Along the horizontal axis, the 
dispersion of the track polar angles in milliradian units (ue = a^p) ; along the vertical axis, the 
RMS dispersion of the pulls of the reconstructed Cerenkov angle. Square markers represent 
the pulls dispersion value when the correlations are neglected and the center is not fitted, cross 
markers show the same quantity when the center is htted ; hnally, round markers show the 
same quantity when correlations are taken into account and the center is htted. Fitting the 
center allows a partial account of correlations (see text). 
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Figure 13: Effect of inter photon correlations due to multiple scattering (the dispersion of track 
direction measurement is set to zero). The reconstructed Cerenkov angle pull rms is plotted as 
a function of the inverse track momentum : cross markers represent the pulls dispersion when 
no correlations are accounted for ; the round markers show the stability of the pulls when the 
correlations are correctly taken into account. 
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Figure 14: Influence of the errors {66, 6(j)) on the incoming track direction on the probability 
distribution when the inter photon correlations are not taken into account. In a) 69 = 6(j) = Q 
mr; inh), 69 = 6(j) = 1 mr; in c), 66 = 6(j) = 3 mr; in d), 69 = 6(j) = 4 mr; in e), 69 = 6(j) = 5 
mr. In f), the same quantity for 69 = 6(j) = 5 mr, when the correlations are accounted for. The 
adjustable cuts used are the same in each case. 
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Figure 15: Reconstructed Cerenkov angle pulls (in a) and biases (in b) for the mixed particle 
sample, one identified additional track has been superimposed as background. 











Figure 17: Effect of different types of background on the pull distributions of the reconstructed 
Cerenkov angle. In a), a flat random noise is superimposed to the signal track ; in b), a second 
track is mixed with the signal track ; in c) two additional tracks are superimposed to the signal 
track. The effect of noise seems more important in case a). 









Figure 18: probability distribution for the sample of mixed particles when using the same 

set of cuts ; in (a), only signal photons are considered ; in (b), a flat random photon noise has 
been superimposed to signal photons; in (c), photons from one additional track (considered as 
background) are superimposed to the original track photons. 
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Figure 19: Adaptation of cuts to background conditions, reflected in the per n^o/ distribu¬ 
tion for the sample of mixed particles when cuts have been correctly set ; in (a), only signal 
photons are considered ; in (b), a flat random photon noise has been superimposed to signal 
photons ; in (c), photons from one additional track (which existence is supposed to be known) 
are superimposed to the original track photons. Accordingly, the probability distributions for 
each of these cases are very close to be flat. 
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Figure 20; Effect of the \S6\ secondary cut and probability cut. The track momentum is 
plotted versus the identihed mass ; units are GeV/c and GeV. In (a), the scatter plot is made 
for generated pion tracks with one additional identified background track; in (b), same plot 
when the additional track is considered as unidentihed (i.e: no |(56'| secondary cut). In (c), \69\ 
secondary cut applied (as in (a)), but requiring additionally a probability greater than 0.01. 
In (d), same plot as in (c), but with a probability cut level of 0.03. 
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Figure 21: Effect of median cut on the probability distribution for the sample of mixed 
particles with no background of any type ; in (a), 1 a cut ; in (b), 2 u ; in (c), 3 a ; in (d), 4 a. 















Figure 22: Effect of the secondary cut. The probability distribution for the sample of 
mixed particles is shown with ; (a), 1 track background and secondary cut ; in (b), 1 track 
background and no |(56*| secondary cut ; in (c), 2 tracks background and |56*| secondary cut ; in 
(d), 2 tracks background and no |(50| secondary cut. Note that the other cuts levels are different 
in cases a) and c). 
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